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Wideband, Bidirectional Lightguide 
Communication With an Optically Powered 
Audio Channel 


By R. C. MILLER, B. C. De LOACH, T. S. STAKELON, and 
R. B. LAWRY 


(Manuscript received December 29, 1981) 


The feasibility of bidirectional speech-television communication 
over a single-strand lightguide, with emergency optical powering of 
the remote station telephone, has been demonstrated. The remote 
circuits drew ~250 pA at 0.9 volt from a double-heterostructure 
photovoltaic detector at 0.65 mW of received optical power; the 
detector low-pass response into 50 ohms was down 6 dB at 60 MHz. 
The optical packages featured GRIN-lens bidirectional couplers and, 
at the remote station, a low-threshold-current, buried-heterostructure 
laser. 


I. INTRODUCTION 


Lightguides connecting remote customer stations to a central switch- 
ing office would make wideband telecommunications available to large 
markets. A two-wavelength, bidirectional link utilizing single-strand 
optical fiber might be economical compared to rechargeable batteries 
or auxiliary metallic conductors, particularly if used with an optical 
detector capable of powering the telephone channel during electric 
utility outage at a remote station. The feasibility of incorporating 
emergency optical powering into a wideband, two-way optical link is 
described below. 

Both wideband and optically powered services can be realized with 
the GaAlAs photovoltaic diode used in previous demonstrations of 


1359 


optically powered speech signalling’ and sound alerting” at a remote 
station. This detector provides efficient photovoltaic conversion® along 
a high-resistance (3-kilohm) load line and short-circuit frequency 
response extending into the television carrier range. Since their capac- 
itances are at least an order of magnitude larger than those of reverse- 
biased p-i-n diodes of comparable area, these devices must be used 
with low-input-impedance circuits at large bandwidths. This reduced 
high-frequency sensitivity is compensated for, at least in part, by the 
high light levels used in photovoltaic applications. 


ll. SIGNAL FORMAT AND OPTICAL METHODS 


The block diagram of Fig. 1 identifies major components of the 
demonstration signalling stations. Each station transmitted and re- 
ceived a complete television channel and an audio bandwidth Fm 
channel via a single-strand fiber lightguide. The local station depended 
entirely on electric utility power and the remote station only partly so. 
If utility power failed at the remote station, bidirectional speech and 
sound-alert capabilities were automatically maintained by photovol- 
taic conversion of optical power arriving from the local station. The 
local-station laser was continuously operated at high enough powers 
to ensure these capabilities regardless of how the remote station was 
being powered. 

The remote-station laser was a low-operating-current, GaAs, buried- 
stripe heterostructure (BSH), producing 50 to 100 microwatts of single- 
ended power at wavelength Az = 0.885 pm, predominately in the lowest- 
order spatial modes; the local-station laser was a planar stripe GaAlAs 
double heterostructure emitting up to 3 mW at wavelength A; = 0.84 
p.m. The FM and TV signals, produced by microphone (MIC)-modulated 
variable-frequency oscillators (vFos) and video-cassette recorders (Tv- 
3) or color-bar generators (TV-4), were superimposed on adjustable 
laser bias currents. The laser emissions were focussed into short fiber 
pigtails by high-refractive-index glass microlenses and delivered to the 
transmission lightguide by GRin-lens bidirectional couplers.* Each 
coupler utilized colinear quarter-period lenses, separated by a dielectric 
multilayer of high reflectivity at A; and high transmission at Az, to 
obtain the wavelength combinations shown by the dashed line in 
Fig. 1. 

Light of wavelength A, was directed onto a GaAs double-hetero- 
structure photodetector in the remote station. High-frequency photo- 
current was capacitor-coupled to the FM and TV receivers, while the 
photovoltaic dc component provided steady-state operation of the FM 
receiver and of a low-duty-factor, pulse-amplitude-modulation (PAM) 
transmitter. In the absence of electric utility power, speech transmis- 
sion switched automatically to this PAM standby channel. The micro- 
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Fig. 1—Block diagram of optics and circuits in local and remote stations. 


phone and speaker were housed in a telephone handset. When the 
handset was on-hook, a photovoltaic-powered sound alert could be 
activated by a manually controlled switch in the local station. Optical 
signals of wavelength A: arriving at the local station were directed onto 
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a silicon avalanche photodetector (APD) connected via an impedance- 
matching transistor to the Tv, FM, and PAM receivers. 


lll. IMPLEMENTATION 


The quantum efficiencies and photovoltaic form factors measured 
at low frequencies for remote-station detectors were comparable with 
values reported? for similarly processed devices of the same nominal 
layer dopings and thicknesses. Design variations in the p-side contact 
diameter and in the diameter and depth of the proton-bombarded 
region produced device capacitances ranging between 30 and 1000 pF. 
Detectors near the lower end of this capacitance range had the typical 
small-signal frequency response illustrated in Fig. 2 and also exhibited 
short-circuit, pulsed-current decay times of 5 to 10 ns, consistent with 
reported’® electron lifetimes. Although photovoltaic double-hetero- 
structure diodes are not optical detectors of choice in low-light-level, 
wideband applications, they provided acceptable reception of Tv chan- 
nels 3 and 4 at prevailing light levels when connected to untuned 75- 
ohm loads. Dispersion was not a factor in limiting the quality of 
television reception for the 0.5-km to 1.5-km-length graded-index light- 
guides used during various phases of this work: 

Frequency modulation at 10.0 MHz was chosen for the telephony 
channel to maintain compatibility with simultaneous television recep- 
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Fig. 2—Frequency response of GaAs photovoltaic detector with 50-ohm load. 
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tion, and particularly to avoid audio-frequency cross modulation of the 
TV displays. Strong FM limiting was provided at the local-station, 
variable-frequency oscillator in lieu of expending photovoltaic power 
to achieve amplitude limiting at the remote-station receiver. The 
receiver utilized a one-transistor FM — AM slope detector to obtain 
linear demodulation over a +50 kHz range, followed by a two-transistor 
audio amplifier which produced comfortable sound volume (approxi- 
mately 0.3 volt peak-to-peak) in a 600-ohm earphone. The receiver 
drew about 70 pA at 0.90 volt from the photovoltaic detector. During 
optically powered operation, the PAM transmitter produced 25- to 40- 
mA current pulses of 100-ns duration in the remote station laser at 12- 
kHz repetition rate. 

The quarter-period graded-refractive-index (GRIN) lenses* used in 
the bidirectional couplers were 4.0 mm long and 1.5 mm in diameter. 
They were aligned along a common optical axis and cemented to a 1.1- 
mm-thick glass flat on which a long-wavelength-pass, sharp-transition, 
dielectric-multilayer filter had been deposited. (The filters were sup- 
plied by Optical Coating Laboratories, Inc. Design procedures for this 
type of edge filter are summarized by H. A. MacLeod.’) Three 100-cm- 
long, connectorized, lightguide pigtails were manipulated into optimum 
coupling position in a sequence that allowed some compensation for 
previous component misalignment, and were affixed to the GRIN-lens 
faces with ultraviolet-curing cement. The insertion losses versus wave- 
length of a typical bidirectional coupler, epoxied into an aluminum 
housing for protection, are shown in Fig. 3; combined losses of the 
coupler pair used in the present work were 1.5 dB or less in either 
channel. Crosstalk protection was particularly important at the local 
station where the power at A; was much larger than at A2. The coupler 
provided approximately 30 to 35 dB of cross-talk protection, and a 
GaAlAs band-edge filter’ cemented to the app added an additional 17 
dB. No crosstalk effects were observed at either station. 

Graded-index, cabled lightguide of 50-um core diameter and 0.22 
numerical aperture was used throughout the final assembly. At either 
station the laser emission injected into the lightguide link was en- 
hanced by thermally forming® a glass lens on the end of an optical 
fiber pigtail. Substantially hemispherical, 125-um-diameter lenses in 
the refractive index range 1.9 to 2.1, made from low-melting-tempera- 
ture glasses of high lead-oxide content, refracted 70 per cent of the 
laser output-facet emission into the optical guided modes. The lensed 
pigtail was epoxied into position relative to the laser to deliver maxi- 
mum output power. 


* These lenses were of the SLW type supplied by Orient Glass, Inc. 
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Fig. 3—Bidirectional characteristic of GRIN-lens coupler. 


IV. PERFORMANCE 


The local-station laser bias current was adjusted to produce a 0.90- 
volt dc level at the remote station, a photovoltage at which the 
optically powered circuits of Fig. 1 drew currents in the range of 220 to 
280 pA. The optical power incident on the photovoltaic detector at 
wavelength A, = 0.84 ym was 0.65 mW, yielding a 1.0-volt open-circuit 
voltage, 330-1A short-circuit current, and a photovoltaic power con- 
version efficiency of 0.34 at the circuit operating point. (Substantially | 
larger efficiencies have been reported® for optimum loading at higher 
optical powers.) The cabled lightguide link was 0.5 km long. The loss 
at Ai was 4.7 dB, comprised of 1.5-dB laser-to-lightguide coupling loss, 
1.2-dB bidirectional-coupler-pair insertion loss, and 2.0-dB lightguide 
transmission loss. Hence 2.0 mW of local-station laser emission were 
needed to power the remote station optically. This power was increased 
to 3 mW for sound alerting. Alerting relied on charge storage during 
silent portions of the sound-alert format to lessen the laser power 
requirements compared to an earlier? method; acoustic power bursts 
of about 0.3-mW peak and 0.5-second duration were generated at 1.4 
kHz, the alerter resonant frequency. 

Television reception via the optical link was subjectively indistin- 
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guishable from that obtained by connecting the receivers via 75-ohm 
cable to the Tv-4 color-bar generator or to the video cassette Tv-3 
output. Speech reception of good commercial telephone quality was 
observed subjectively at either station under both electrically and 
optically powered conditions, and the transition between electrical and 
optical powering of the remote station occurred without significant 
alteration in audio quality. 


V. CONCLUSIONS 


The ability of double-heterostructure photovoltaic devices to detect 
high-frequency optical modulations, while providing efficient photo- 
voltaic generation of dc power, has been established. The feasibility of 
bidirectional speech-television communication over a lightguide link, 
with speech communication capability during utility power outages at 
the remote station, was demonstrated. 
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Equalization of Multimode Optical Fiber 
Systems 


By B. L. KASPER 


(Manuscript received December 31, 1981) 


Decision feedback equalization (DFE) may provide a simple way to 
dramatically increase the bit rate on multimode optical systems. 
Computer simulations of digital multimode systems with tapped 
delay-line linear equalization and/or DFE are described. Results 
indicate that with relatively simple hardware it may be possible to 
increase by many times the usable bit rate on dispersion-limited 
optical channels, with a power penalty less than that for multilevel 
transmission and without the latter’s complexity. For example, one 
could double the bit rate of a two-level system by paying an equali- 
zation power penalty of about 3 dB with DFE, whereas doubling the 
bit rate by going to four levels requires a penalty of 4.8 dB. 


1. INTRODUCTION 


It has been recognized for some time’ that conventional linear 
equalization is of limited benefit in improving the performance of 
multimode optical fiber channels. The major reason is the rapid falloff 
in the equivalent baseband frequency response of the optical channel, 
which results in considerable noise enhancement when linear equali- 
zation is attempted using a filter which emphasizes the high frequen- 
cies. 

However, for data rates above 100 Mb/s, multimode systems are 
generally modal dispersion rather than loss-limited* and some form of 
equalization is highly desirable. This is particularly true considering 
that index-grading imperfections in presently manufactured fibers 
produce a wide spread in fiber bandwidths and that equalization could 
improve the yield of usable fibers. 

Decision feedback equalization (DFE) is known to be superior to 
linear equalization for channels which exhibit amplitude distortion.** 
Previous theoretical investigations”® of DFE for fiber transmission 
systems have shown that power penalties are significantly lower than 
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those using linear equalization, with the improvement becoming 
greater as the amount of dispersion increases. 

This paper gives the results of computer simulation of fiber optic 
channels for cases having no equalization, linear equalization via 
transversal filtering, and DFE both with and without a preceding stage 
of transversal equalization. We show that for pulses with either Gaus- 
sian or cosine-squared dispersion, decision feedback alone does a 
remarkably good job. A single DFE tap is adequate over a wide range 
of fiber bandwidths. A discussion of the effect of timing phase on DFE 
performance is included. The use of a transversal filter with minimum 
mean-square error (mse) tap weights preceding the DFE stage is found 
to produce little improvement in performance. 


ll. SYSTEM MODEL 


To investigate the performance of various equalizers, the model 
shown in Fig. 1 was chosen. The input and output parameters of the 
model are shown and are explained below. 


2.1 Transmitter 


The input data sequence, {a,}, consists of elements which are 
assumed to be independent identically distributed discrete random 
variables. These discrete amplitudes modulate the power of the trans- 
mitted pulse p,(t) at a rate B = 1/T to produce the transmitted signal. 
Optical detectors produce an output current proportional to received 
power that is always positive; therefore, p;(t) which defines the pulse 
shape and the elements of {a,} which determine the pulse amplitudes 
must always be greater than or equal to zero. Input data sequence, 


No 


Prlt) Prit) fs s(t) 
O-{t) Or{t) G Hip (f) Vip {t) 
nine aoy 
Rane TT AMPLIFIER “FILTERE 
{@n} DISPERSIVE 
3 FIBER 
















Agit 
a? f(t) 





OFFSET C 


DECISION 
CIRCUIT 





DECISION 
FEEDBACK 


{3,°+* bay} 


Fig. 1—Block diagram of optical system model showing important parameters. 
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{an}, has an expected value a@ and a variance o2. For the binary case, 
Gn = 0 or 1, we have & = 0.5 and oz = 0.25. 
The transmitted optical signal is thus 


Ot) = y anp(t — nT), 


n=—o 


where O,(t) is in units of optical power. The average transmitted power 
is 


(O.(t)) = <| piAt)dt. 
The optical signal can be constrained by a limit in either average 
power or peak power. 


2.2 Channel 


The optical fiber has an impulse response /;(¢) such that the received 
optical signal power is 


O,(t) = i Or) hy(t — t)dr. 
The average received power will be 


(O,(é)) = i O,(t)dt 


= | i Olt) h(t — 7)dtdr 


-| oun) | h;(t — r)dtdr 


-| Oude | h;(t)dt 


= (O,(é)) -H;(0), 


where H;(0) = f{%..4(t)dt is the de or steady-state fiber loss. 
The shape of an isolated received pulse will be 


Pr(t) = pelt) *hy(t), 


where * denotes convolution, hence the received signal can be ex- 
pressed as 


O,(t) = y anp(t — nT). 


n=—00 
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2.3 Detector and amplifier 


The detector and amplifier together produce an output voltage va(t) 
proportional to the received optical power, plus added noise n/(t), 


va(t) = G[O,(t) + n(é)], 


where G is a gain constant. 

In general,’ the noise spectrum of receivers referred to the input can 
be characterized by a noise corner frequency, f,-, above which the 
noise power increases at 6 dB/octave. Hence 


N(f) = No[1 + (f/fne)"], 


where Np is the low-frequency noise spectral density. This characteri- 
zation is a good approximation for receivers employing either ava- 
lanche photodiode or p-i-n detectors, and either high impedance (in- 
tegrating) or transimpedance amplifiers. It is assumed that 1/f noise 
can be neglected for bit rates of 100 Mb/s or more. 


2.4 Low-pass filter 


The amplifier in Fig. 1 is followed by a low-pass filter Hz p(f) which 
attempts to maximize the s/n by eliminating high-frequency noise and 
by performing phase equalization on the received pulses. If linear 
equalization or DFE is employed following this filter, its optimum form‘ 
is known to be a whitening matched filter for the noise spectrum N(f) 
and received pulse shape p,(t). Such a filter produces perfect phase 
equalization but does not perform amplitude equalization, hence inter- 
symbol interference (1s1) will generally be present between pulses at 
its output, if the channel exhibits amplitude distortion. If additional 
equalization does not follow, then the low-pass filter should be used to 
reduce 1sI through amplitude, as well as phase equalization. The 
desired goal is often a raised-cosine pulse which has zero-crossings at 
all other sampling instants. However, amplitude equalization results 
in the enhancement of noise at frequencies which are “bumped up,” 
limiting the attainable s/n improvement. 

In a practical repeater, Hrp(f) will generally be some nonoptimum 
fixed filter. If subsequent equalization is included, the filter will be 
near-optimum as long as it passes frequencies where p,(t) has signif- 
icant energy and rejects noise at frequencies where p,(t) has little 
energy. Phase equalization, if necessary, should be done here or by a 
following transversal equalizer. If phase equalization is not attempted, 
Hip(f) should have a linear phase characteristic to prevent the addi- 
tion of phase distortion. 

In this paper, no attempt is made to optimize Hzp(f). A fixed filter 
is chosen with the optimization being left to later equalization stages. 
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2.5 Transversal equalizer 


The transversal equalizer is illustrated in Fig. 2. The output vr«(t) 
consists of the weighted sum of delayed versions of the input uzp(é). 
The weighting coefficients are {d;,} and the delay interval is T. Hence 


L 
Ure(t)= Y drpurp(t+ kT). 
k=—K 

A number of papers have been written about transversal equalizers, 
and their characteristics are well known. The design problem consists 
of choosing the number of leading and trailing taps L and K, and the 
weighting coefficients {d,}. If Hzp(f) is a whitening matched filter, 
then a synchronous equalizer with tap spacing, T, is known to be 
optimum.’ If Hzp(f) is suboptimum, then a fractionally spaced equal- 
izer® with more frequent taps can be used to advantage. The extra 
degrees of freedom provided by more taps are used to approach the 
matched filter condition. 

If decision feedback is used following the transversal equalizer, then 
it has been shown‘ that trailing taps which correspond to decision 
feedback taps can be eliminated. With enough DFE taps to cancel all 
pulse postcursors, only leading transversal taps are necessary. The 
coefficients {d;,} are commonly chosen to either minimize peak distor- 
tion (the zero-forcing equalizer) or mse.? 

The transversal equalizer can be viewed as an extension of the low- 
pass filter Hzp(f). To eliminate IsI, it must shape the pulse spectrum 
to that of a Nyquist pulse. Frequencies attenuated by the channel, 
usually the higher frequencies, must be enhanced with the consequent 
penalty of increased noise. The advantage of using a transversal 
structure to accomplish this filtering is ease of adjustability, especially 
when the equalizer must be adapted to an originally unknown or slowly 
time-varying channel characteristic. However, as mentioned in the 
introduction, linear equalization of this type is known to be of little 
benefit for optical channels because of exorbitant noise enhancement. 


vppiLT) v_p(O) 
viptt) } 7 Yip (-KT) 








£ LEADING TAPS K TRAILING TAPS 


Fig. 2—Block diagram of transversal equalizer stage. 
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2.6 Decision feedback 


The decision feedback stage is diagrammed in Fig. 3. A decision 
circuit or comparator produces estimates {G;} of the transmitted data. 
A weighted sum of past decisions is subtracted from the incoming 
signal, with the weights {,,.} being exactly equal to the amplitude of 
corresponding pulse postcursors at each sampling instant. Thus, if the 
decisions {d,} are correct, then 1s1 from already-detected pulses is 
completely removed with no enhancement whatsoever of the noise. 

An offset C is included to allow optimization of the decision thresh- 
old. 


ll, COMPUTER SIMULATIONS 


The objective of the computer simulations is to compare the power 
penalties of different equalizers for various amounts of fiber dispersion. 
Inputs to the model include the number of DFE taps M, the number of 
leading and trailing transversal filter taps L and K, the low-pass filter 
response Hzp(f), the receiver noise corner frequency fr, and the 
received pulse shape p,(¢). Outputs consist of the receiver low-fre- 
quency noise level No for an error probability of 10-°, the optimum 
transversal tap weights {d,} and DFE tap weights {b,}, and the 
equalized pulse shape r(Z). 


3.1 Error probability bounds 


The calculation of exact error probabilities in the presence of ISI 
plus noise is very complex, hence one generally resorts to various 
bounds. One common worst-case bound compares eye opening for the 
most adverse message sequence to the noise standard deviation. The 
criterion Q is defined as 


OFFSET C 







vre (t) DECISION 


CIRCUIT 


Fig. 3—Block diagram of decision feedback stage. 
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where 


ro = pulse height at sampling instant 


II 


Tn = pulse height (IsI) in other time slots 


S = set of time slots equalized by decision feedback 
o = noise standard deviation. 


This bound is relatively good if the IsI is large relative to the noise and 
is limited to only a few symbols, but it is pessimistic if IsI is small and 
extended over time, as the worst-case bit sequence will then occur only 
rarely. For this case, a better bound is obtained by approximating ISI 
by anormal distribution and adding its variance to the noise variance, 
giving the usual mse bound.*” 

Tighter bounds than either of these can be found at the expense of 
various degrees of computational complexity.’ 

The impulse response of optical fibers is typically found to resemble 
a Gaussian or a cosine-squared pulse."’ Therefore, the Ist is limited to 
only a few adjacent symbols. In this situation, the worst-case eye 
opening or @ bound is a reasonably good choice. 


3.2 Offset and decision feedback optimization 


The optimum mse DFE for fiber optic systems has been analyzed by 
Messerschmitt.” Let the shape of an isolated pulse at the transversal 
filter output be denoted by r(z): 


r(t) = GL p-(2)*hip(t)*hrr(d)], 
where 
G = amplifier gain 
hyip(t) = low-pass filter impulse response 
hrr(t) = transversal filter impulse response 
= 5 d,d(t + kT). 
k=—K 
The decision threshold can be optimized by an offset C given by’ 


C = No0GHip(0)Hrr(0) + a y ™m™—- bm — | 


n=—o0 mes 
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where Ao is the detector dark current and S is the set of DFE taps 1, 
--., M. Also, the decision feedback tap weights are optimized by 
setting 


bm = Tm, MES. 


The values r,, are sampled versions of r(t), with r, corresponding to 
the instant at which the pulse is sampled for detection. With the above 
choice of DFE tap weights, we have 


C = \oGH p(0)Arr(0) + a Yn 1 ; 
n¢S 


With the above choices of C and {b,,}, and assuming correct decisions, 
the mse is shown to be® 


mse = E[(uz — dz)”] 
- tl 5 r= Ore +1| +o’, 
n¢S 
where 
o” = noise variance 
= E[[Gn(t)*hip(t)*hre(t)}]. 


As before, n(t) is the receiver noise (thermal plus shot noise) referred 
to the receiver input. Note that only terms r,, n € S (i.e., no corre- 
sponding DFE taps) contribute to mse. 


3.3 Transversal filter optimization 


The problem of determining optimum transversal filter tap weights 
{dx} when decision feedback is used was first solved by Austin.” In 
matrix form, the solution can be expressed as 


D=A‘'a, 
where 
D = vector of tap weights 
=> [d_x, d_K+1, See 9 d,]" 
1 
A=y+ >5¢ 
Oa 


@ = noise covariance matrix of terms dzm 


drm = Pk-m = | N(f)Hip( f)e re”? df 


—o 
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w = modified signal autocorrelation matrix of terms Wim 


Wim a Sn+kSn+m 
ngS 


a= [s_x, S—K+1,; bene: sz]7 


S, = sample of isolated pulse at low-pass filter output 


= s(kT) 
s(t) = G[p-(t)*hrp(t)]. 
The matrix 


1 
A= + 4¢ 
On 


is symmetric and positive semidefinite, hence it is invertible, except in 
extreme cases unlikely to be encountered in practice. The above 
solution for the tap weight vector D minimizes mse. This criterion may 
not produce the best bound on the probability of error, but is used 
here because it is the only bound for which optimum tap weights are 
known to have been formulated. 


3.4 Power penalty calculation 


Equalization power penalties are computed by assuming that the 
received pulse p,(t) has unit area (i.e., unit energy) and solving for the 
receiver noise level No which will produce an error probability of 107°. 
Because the transversal filter coefficients D are dependent upon the 
noise covariance matrix @, which is itself proportional to No, an 
iterative solution is necessary. The simulation program allows the 
operator to enter an initial value for No, and Newton-Raphson iteration 
is then used to converge to a final solution for both No and D. 

An error probability of less than 10° is assured by making Q equal 
to 5.99781. The eye opening £ in the presence of decision feedback is 
calculated as 


E=ro— ¥ |ral, 
néS 
n¥0 


while the noise standard deviation is 
Co = [NoD7oD]!”. 


Iteration proceeds by assuming a value for No and calculating the 
corresponding tap weights D. The eye opening and o, plus their 
derivatives with respect to No, can then be found, allowing new values 
of No to be calculated until the ratio Q = 0.5E/o converges to 5.99781. 
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3.5 Timing phase optimization 


When decision feedback with no transversal equalizer was analyzed, 
or when only leading or anticausal transversal taps were specified, the 
power penalties were found to depend heavily upon timing phase. 
Optimization was carried out by simply advancing the sampling time 
(i.e., delaying the pulse) and finding the delay which resulted in the 
lowest power penalty. 


IV. RESULTS 
4.1 Gaussian dispersion and equalizing filter 


Fibers with perfect mode mixing exhibit Gaussian dispersion and 
have an impulse response given by 
hy (t) = tte, 
27a 
The baseband frequency response is 
H;(f) _ el@raT/?/2) 
and the 6-dB electrical bandwidth is 


1 
Ioan = 5p V2 In 2 


= 0.1874/aT. 


Figures 4, 5, and 6 show power penalties for rectangular nonreturn 
to zero (NRZ) pulses transmitted over such a fiber. An NRz pulse is 
defined by 


T 
2 


0, elsewhere 


1 <ixi 
mae 


x(t) = 


X(f) = a sin af T. 
af 


The low-pass filter Hzp( f) is chosen to produce a Nyquist output pulse 
with a raised cosine characteristic’ and an excess bandwidth of 8 = 0.5 
for an NRZ input pulse. The output pulse spectrum is 
B 
1,0=/fl=5 0-8) 


1 ‘ 1\; B B 
Y(f) = s[1-sing (£-3) |. Fa-m<ifl< 50+ 


0, elsewhere , 
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hence the required low-pass filter characteristic is 
Y( B 
a lf<Za+e 
Hy p(f) = 
0, elsewhere. 


For zero fiber dispersion, this low-pass filter produces a Nyquist output 
pulse with no IsI. The 1s! will be produced by fiber dispersion and will 
increase as the fiber bandwidth decreases. 

Power penalties in the following results are calculated relative to 
ideal matched-filter detection of an isolated NRZ pulse with zero 
dispersion. 


4.1.1 White noise (f,. > B) 


An actual optical receiver will not generally have a white noise 
spectrum. With a high-gain APD, for example, the noise is proportional 
to the received signal and so is time dependent. However, as a worst- 
case situation, one may assume stationary white noise with a level 
equal to that for the most adverse message sequence. 

Figure 4 shows power penalties for a white receiver noise spectrum. 
The four curves are for A—no additional equalization; B—11-tap 
transversal equalizer; C—five DFE taps plus 11-tap transversal equal- 
izer; and D—five taps of decision feedback with optimized timing 


A — NO EQUALIZER 
B — 11 TRANSVERSAL TAPS 
C — 5 DFE TAPS PLUS 
11 TRANSVERSAL TAPS 
D — 5 DFE TAPS ONLY PLUS 
OPTIMUM TIMING 


O- N-LEVEL 





RATIO OF BIT RATE TO 6-DECIBEL BANDWIDTH, & 


| Fig. 4—Power penalties versus ratio of bit rate to fiber bandwidth for Gaussian 
dispersion, equalizing low-pass filter, and white noise. 
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phase but no transversal equalizer. Also shown are power penalties for 
multilevel signaling which are discussed in Section 4.5. 
The horizontal axis values are in terms of a variable R given by 


bit rate B 
6-dB electrical bandwidth (3-dB optical)’ 


As the 6-dB channel bandwidth becomes less than the Nyquist fre- 

quency (i.e., R > 2), DFE plus linear equalization (curve C) begins to 
perform much better than linear equalization alone (curve B). Most 
notable, however, is that DFE only (curve D) does nearly as well as 
DFE plus linear, and beyond R = 4.5 remains within 0.2 dB. The 
addition of a transversal equalizer stage prior to DFE produces almost 
no improvement. The reason for this is that the transversal tap weights 
have been chosen to minimize mse, whereas the criterion of goodness 
is the ratio of eye opening to noise standard deviation. The problem is 
that almost all of the IsI is due to one large immediate precursor. The 
mse algorithm seeks to minimize the sum of the square of this precursor 
and the noise variance. To reduce the ISI, it must allow the noise to 
increase, which it does to such a degree that according to the eye 


oe at y Ia¢41n Ana ha aa 
opening critoricn ve ery altuc nas oéen gained. 


Prior to adoption of the eye opening criterion, an attempt was made 
to use mse as the power penalty measure. However, this criterion 
proved to be extremely pessimistic in the absence of a transversal 
equalizer. The mse is dominated by a few large Ist values and indicated 
very high-power penalties. However, the eye opening criterion showed 
the actual penalties to be much smaller. 


R= 


4.1.2 Colored noise (f,. = B/3) 


The curves in Fig. 5 were obtained assuming a receiver noise 
spectrum which increases with frequency. The noise corner frequency 
is chosen as f,- = B/3, and is based upon parameters of a typical 
p-i-n/FET receiver operating at 100 Mb/s as described in the Ap- 
pendix. 

In comparing Figs. 5 and 4, note the difference of 1.12 dB in the 0- 
dB reference levels (i.e., matched-filter detection is 1.12 dB worse for 
colored noise than for white noise). Disregarding this baseline shift, 
curves A and D in both figures are identical, as should be expected, 
because with no transversal equalizer the error rate depends only on 
the noise variance and not on the shape of the noise spectrum. The 
addition of a transversal equalizer does not help as much as in the 
white noise case. The reason is that a linear equalizer works by 
enhancing high frequencies which now contain more noise than they 
did for white noise. 

Comparing curves C and D, we see that the addition of a transversal 
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A —- NO EQUALIZER 
B — 11 TRANSVERSAL TAPS 
C — 5 DFE TAPS PLUS 
11 TRANSVERSAL TAPS 
D ~ 5 DFE TAPS ONLY PLUS 
OPTIMUM TIMING 


© - N-LEVEL 


POWER PENALTY IN OPTICAL DECIBELS 





RATIO OF BIT RATE TO 6-DECIBEL BANDWIDTH, A 


Fig. 5—Power penalties for Gaussian dispersion, equalizing low-pass filter and colored 
noise with f,, = B/3. Receiver noise spectrum corresponds to a p-i-n FET receiver as 
described in the Appendix. 


equalizer improves the power penalty by about 1 dB over DFE alone. 
As DFE removes much of the Ist, the transversal stage is able to 
decrease the mse by acting to reduce high-frequency noise. It can be 
expected that the choice of a fixed low-pass filter Hip( f) with a lower 
3-dB frequency than B/2 would produce similar power penalty im- 
provements for DFE only. A narrower Hrp(f) would not affect IsI 
significantly if the channel bandwidth is already less than the low-pass 
filter bandwidth. 

For cases where f,- is lower than B/3 as assumed here, the choice of 
a minimum Hrp(f) bandwidth becomes more crucial because the 
amount of high-frequency noise increases. Decision feedback equali- 
zation is very important in such cases as it provides a way of noiselessly 
eliminating the Is1 introduced by a narrow low-pass filter. 


4.1.3 One decision feedback tap 


The curves in Fig. 6 are for white noise with five DFE taps plus 
optimum timing phase and one DFE tap plus optimum timing phase. 
Up to a bit rate of 3.5 times the 6-dB bandwidth, the curves remain 
within 0.4 dB of one another. Hence, a single DFE tap with no 
transversal equalizer, which in hardware would require one flip-flop, a 
summer and a weighting network, does a remarkably good job of 


MULTIMODE OPTICAL SYSTEMS 1379 


A — 5 DFE TAPS ONLY PLUS 
OPTIMUM TIMING 


B — 1 DFE TAP ONLY PLUS 


OPTIMUM TIMING 


POWER PENALTY IN OPTICAL DECIBELS 





RATIO OF BIT RATE TO 6-DECIBEL BANDWIDTH, R 


Fig. 6-—Power penalties for decision feedback alone. Gaussian dispersion, equalizing 
low-pass filter and white noise. 


equalization for Gaussian dispersion. The reason for this success is 
that up to R = 3.5, almost all of the Isi is due to one large postcursor 
(provided sampling is done before the pulse peak), and one DFE tap 
can remove this IsI completely. 


4.2 Gaussian dispersion and Bessel filter 


The raised cosine low-pass filter used in the previous cases could be 
difficult to design in practice. Therefore, it is of interest to see what 
happens with an ordinary realizable filter, in this case a 5th order 
maximally linear phase or Bessel filter, with a 3-dB bandwidth of B/2. 

The exact low-pass filter response is 


mn (fj sam (9) 


ai f\" f\' 
+j—| 4.855 — 12.70 {—] +2. = 
IZ 12.70 3B 2.859 3B 
Results for white receiver noise and Gaussian dispersion are shown in 


Fig. 7. 
Compared to Fig. 4, the power penalty for no equalization is higher 


AtP(f) = 
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B — 11 TRANSVERSAL TAPS 
C — 5 DFE TAPS PLUS 

11 TRANSVERSAL TAPS 


D — 5 DFE TAPS ONLY PLUS 
OPTIMUM TIMING 


O- N-LEVEL 
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RATIO OF BIT RATE TO 6-DECIBEL BANDWIDTH, & 


Fig. 7—Power penalties for Gaussian dispersion, Bessel low-pass filter and white 
noise. The low-pass filter is a 5th order maximally linear phase Bessel filter with a 3-dB 
bandwidth of B/2. 


after R = 2.5 as expected because the Bessel filter does a poorer job of 
equalization than the previous low-pass filter. The linear equalization 
curves are identical because the transversal filter can compensate for 
a poorer choice of Hz p(f). Both decision feedback curves show penal- 
ties less than 0.3 dB greater than those in Fig. 4, which would indicate 
that the choice of low-pass filter is not too critical. 


4.3 Cosine-squared dispersion and equalizing filter 


Measurements of actual fibers indicate that a cosine-squared pulse 
shape is often more representative of the actual impulse response than 
a Gaussian.’ Power penalties for NRz pulses transmitted over a fiber 
with cosine-squared dispersion are shown in Fig. 8. The receiver noise 
spectrum is white and the low-pass filter produces 8 = 0.5 raised cosine 
equalization for an NRZ pulse with zero dispersion. 

Compared to Fig. 4, we see that linear equalization is much less 
effective for cosine-squared dispersion than for Gaussian dispersion. 
The reason for this is found by considering the channel’s bandpass 
characteristic. If its impulse response h(t) is given by 


t 
2/7 cos” a —7/2<st<7/2 
A,(t) = 7 


0, elsewhere, 
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RATIO OF BIT RATE TO 6-DECIBEL BANDWIDTH, A& 


Fig. 8—Power penalties for cosine-squared dispersion, equalizing filter, and white 
noise. 


then the baseband frequency response is 


Ap = 22h | i i 


aft 1 — f?r? 


The channel will have nulls wherever sin 7fr = 0 (except at f = 0 and 
f = 1/7). As is well known, a linear equalizer would need infinite gain 
to correct for such nulls, resulting in infinite noise enhancement. 

The DFE alone is able to perform equally as well for cosine-squared 
dispersion as for Gaussian. However, DFE plus linear equalization is 
somewhat worse than DFE alone above R = 4.0. The choice of minimum 
mse tap weights appears to be particularly bad for cosine-squared 
dispersion. 


4.4 Dependence on timing offset 


To increase the effectiveness of decision feedback in eliminating ISI, 
one should arrange to have as much of the IsI as possible come from 
pulses which have already been received (postcursors) and for as little 
as possible to come from future pulses (precursors). The simplest way 
to do this is to advance the sampling instant such that decisions are 
made on the leading edge of the pulse rather than on the center of the 
pulse. Because optical pulses are generally quite limited in duration 
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5 LEADING TRANSVERSAL TAPS 


POWER PENALTY IN OPTICAL DECIBELS 





0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 
TIMING OFFSET IN UNITS OF T 


Fig. 9—Power penalties versus sampling time advance for cosine-squared dispersion. 
Ratio of bit rate to fiber 6-dB bandwidth is R = 5.0, Equalizing low-pass filter and white 
noise are assumed. 


and do not have extensive post- and precursors, this simple technique 
combined with DFE may offer a substantial improvement in perfor- 
mance. 

Figs. 9 and 10 show calculated power penalties versus timing offset 
for cosine-squared and Gaussian dispersion when R = 5.0. The receiver 
noise is white, and an equalizing low-pass filter is assumed. The solid 
curves are for five DFE taps only, whereas the dashed curves are for 
five DFE taps plus five leading transversal taps. 

For DFE only, advancing the sampling time by 0.6 of a bit interval 
improves the power penalty by 4.0 and 2.6 dB in these two cases. 
Improvements are also produced for DFE plus linear equalization, with 
the optimum timing advance being greater than one bit interval. 

To illustrate the effect of offset sampling, consider the example in 
Fig. 11. The pulses are produced by convolving a rectangle of width T 
with a Gaussian having a standard deviation of 0.937 T, equivalent to 
a channel bandwidth of B/5. With DFE, only precursor ISI is important. 
The eye openings for (a) a centrally sampled pulse and (b) a pulse- 
sampled 0.5 T earlier will be 


(a) Emre ¥ |r 
n#¥0 
= 1.0 — 0.58 — 0.10 = 0.32 
(b) E = 0.87 — 0.29 — 0.02 = 0.56. 
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Fig. 10—Power penalty versus sampling time advance for Gaussian dispersion. 
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Fig. 11—Pulse waveforms with Gaussian dispersion to illustrate the advantages of 
sampling prior to the center of the pulse. (a) Central sampling. (b) Sampling advanced 
by 0.5 T. 
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The larger eye opening with advanced sampling produces a 2.4 dB- 
optical power penalty improvement, as o is the same in both cases. 

One disadvantage of advancing the sampling time is that it will 
increase the likelihood of DFE error propagation. This likelihood is 
dependent upon the DFE tap weights, and will increase with timing 
advance because the postcursors become both larger in amplitude and 
more extended in duration. Therefore, the improvement in power 
penalty will not be as large in practice as indicated above. However, 
the gains can be expected to be much greater than the losses, because 
doubling the size of the eye opening through DFE can reduce the 
probability of error by perhaps four orders of magnitude, whereas the 
increase in the error rate because of error propagation cannot be more 
than 2” where M is the number of DFE taps.” 

A most interesting point is that theoretical derivations of the opti- 
mum transversal filter for use with decision feedback have always 
concluded that a one-sided transversal filter is optimal and that 
no taps following the peak of a matched-filter pulse are necessary.**" 
This implies a timing phase such that sampling coincides with the 
peak of the pulse. However, the simulation results herein show that 
advanced sampling leads to better performance. One way to obtain the 
necessary optimal timing phase is to have a few trailing transversal 
taps which can delay the pulse relative to the sampling instant. Hence, 
11 transversal taps with five leading and five trailing were found to 
perform much better than five leading taps only. 


4.5 Comparison of DFE and multilevel signaling 


The use of DFE more closely approximates the use of partial-re- 
sponse” or multilevel signaling than the use of traditional linear equal- 
ization. Binary signals transmitted at a rate greater than twice the 
optical-channel, 6-dB bandwidth will always contain IsI when received. 
The received level, however, is a predictable function of the transmit- 
ted message sequence. Decision feedback provides a simple way of 
decoding the multiple received levels back into the transmitted binary 
digits. 

It is useful to compare the previous DFE power penalties to those 
which one could expect for multilevel signaling. Consider the case of 
Gaussian dispersion and white receiver noise. For two-level transmis- 
sion, one might operate at a symbol rate equal to twice the channel’s 
6-dB bandwidth (i.e., R = 2). Very good linear equalization would be 
needed to prevent ISI, hence from curve B of Fig. 4 the power penalty 
for two-level transmission would be 1.8 dB. For N-level transmission, 
there is an additional power penalty of 10 logio(N — 1) dB, with a 
factor of logsN increase in R. Total power penalties for multilevel 
transmission for the various combinations of receiver noise spectra, 
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low-pass filters, and fiber impulse responses considered previously are 
included in Figs. 4, 5, 7, and 8. In all cases, both DFE plus transversal 
equalization and DFE only show better performance than multilevel 
signaling at the same bit rate. For example, by using DFE doubling R 
from 2 to 4 results in a penalty of about 3 dB in all cases, whereas 
doubling R by using 4-level signaling results in a penalty of 4.8 dB. For 
tripling to R = 6, DFE is consistently better than 8-level signaling by at 
least 2 dB. 

Hence, DFE may offer a way of obtaining the performance of multi- 
level optical transmission without many of the latter’s disadvantages. 
For instance, only two-level (on-off) transmitter modulation is needed. 
Also, complex linear equalization to prevent ISI is unnecessary. The. 
hardware to implement DFE is extremely simple. 


V. CONCLUSIONS 


The main conclusion is that DFE offers substantial benefits for 
multimode optical fiber systems. Channels which are presently disper- 
sion-limited could potentially be operated at many times the existing 
bit rate with a power penalty less than that for multilevel signaling. 
The hardware required to implement the necessary equalization can 
be extremely simple as transversal filters are not a necessary compo- 
nent and no special filtering need be included. In addition, the char- 
acteristics of an optical channel are very stable, making circuitry for 
continuous adaptability unnecessary. 

A good deal of theoretical work remains. For transversal equalizers, 
a new technique of selecting tap weights besides the traditional zero- 
forcing or minimum mse algorithms must be developed. For decision 
feedback, further investigation of error propagation is needed. Atten- 
tion should be given to optimizing the sampling time and to its effects 
on the probability of error propagation. Sensitivity to errors in the 
. sampling time should also be considered. 

Experimental confirmation of the simulation results described above 
is also very desirable, and work along these lines is now proceeding. 


APPENDIX 
P-i-n FET Receiver Parameters and Noise Corner Frequency 


According to Smith and Personick,’ the noise corner frequency is 
given by 


“2 1/2 
re 1 E. (@) ea/af 4 


~ QaCr| d(ex(w))/df Ra 
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For a p-i-n FET receiver, we have 
d 
df 


(i7(w) )eq = equivalent input shunt 
current noise source 
_ 4kT 


L 





a 2qI gate + 2qlaark 


d . ; : 
—.(e2(w)() = equivalent input series 


df 
voltage noise source 
4kTT 
ma 
Some typical parameter values for a good p-i-n FET receiver at a bit 
rate of B = 100 Mb/s are 





Rin = Ry = 1 MQ 


Cr=1 pF 
£m = 50 ms 
I‘ = 1.75 
Tate = 10 nA 
Taare = 10 nA. 


With the above values, one obtains a noise corner frequency of fie = 
31.7 MHz = B/3. 
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Frequency-Hopped Single-Sideband 
Modulation for Mobile Radio 


By V. K. PRABHU and R. STEELE 
(Manuscript received April 4, 1981) 


A frequency-hopped single-sideband (ssB) modulation with pilot 
tone (FH-SSB-PT) transmission scheme ts described for mobile radio. 
The single-sideband signal changes its carrier frequency every T 
seconds, and the sequence of carrier frequencies is controlled by a 
suitable scrambling code. Co-channel cells using the same frequency 
band in the mobile radio system use different scrambling codes with 
the result that co-channel interference is altered from cross-talk to 
random noise if t is small. In the presence of Rayleigh fading, 
frequency hopping improves reception for stationary vehicles, and 
does not degrade the performance for moving vehicles. Further, the 
technique of frequency hopping may offer communications privacy. 
Expressions for s/n are derived for SSB, SSB plus pilot tone, and 
FH-SSB-PT in the presence of Rayleigh fading. Co-channel interference 
effects are also considered. 


1. INTRODUCTION 


Mobile radio telephone systems are required to operate in a hostile 
environment compared to radio systems using terrestrial and satellite 
links. The transmission channel is time varying in a manner dependent 
on the vehicle speed and location, distribution of buildings and terrain, 
siting of fixed antennas relative to the mobile, the effect of other 
mobiles, to mention but a few. Paramount of the design objectives is 
a high user density with good uninterrupted communication at a 
reasonable cost. To achieve these goals, a number of system concepts 
have been advocated,” and a developmental system to provide Ad- 
vanced Mobile Phone Service (Amps) is currently being operated by 
Illinois Bell Telephone Company in the Chicago Area.° 

Unlike conventional telephony where extra demand on communi- 
cation capacity can be accommodated by additional links, the mobile 
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radio spectrum is limited. To increase capacity, the area must be 
divided into cells and the RF band must be reused. Although mobiles 
in adjacent cells can be arranged not to interfere, co-channel interfer- 
ence caused by frequency reuse must be controlled in these systems. 
Thus, the mobile radio engineer increases capacity, not by using extra 
frequency assignments but by re-using the spectrum and taking care 
to reduce co-channel interference by adequate spacing between cells 
using the same frequencies. 

For urban mobile radio, the hexagonal cell structure giving complete 
coverage of an area has been extensively studied and is the one 
currently in vogue. In a cell there is either an omnidirectional antenna 
at the center of the cell site (start-up mode), or three directional 
antennas at alternate corners (fully operational mode). Assuming that 
mobiles in a cell do not interfere with others in the same cell, the main 
source of signal impairment results from fading of the received signal 
because of multipath effects present in an urban environment. The 
fading is statistically described by the Rayleigh distribution,’ and is 
not instantaneously correlated across the frequency band of interest— 
an effect known as selective fading. Some other sources of signal 
degradation, in addition to co-channel interference, are ignition and 
receiver noise. 

The choice of type of modulation is very important if we are to 
attain our design objectives. Various proposals have been made, and 
can be separated into two basic groups, those advocating digital 
modulation methods, and others arguing for the relatively well-known 
analog techniques.*” In this paper, we specifically consider the use of 
single-sideband modulation (sss) for mobile radio telephony. We do 
this because ssB does not result in any expansion of the occupied 
bandwidth, merely in its linear frequency translation. Single-sideband 
modulation has been exhaustively investigated at HF, and some pro- 
posals®* have been made for its introduction in mobile radio. However, 
no complete work appears to have been done in connection with its 
use in cellular structures. The AMPs system opted for frequency mod- 
ulation (FM) rather than ssB, exploiting the ability of Fm to capture 
the wanted signal in the presence of an interfering signal, provided the 
latter is below a certain threshold level. There is no capture effect 
inherent in ssp, and for a given annoyance in cross-talk the interfering 
signal level must be significantly reduced compared to FM. With its 
nonconstant envelope, SSB is also more vulnerable to fading than FM. 
However, FM requires significantly more bandwidth than ssp, and it is 
this fact that has prompted us to have a closer look at ssB to see if we 
can mitigate its inherent weaknesses. 

We describe quantitatively how well ssB performs when subjected 
to frequency selective fading and co-channel interference in a mobile 
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radio environment. The effect of using pilot carriers to combat fading 
is considered in detail. 

We then propose a new SSB system which we have called frequency- 
hopped SSB (FH-SSB). We attempt to ensure that a particular channel 
is never continuously in a fade, and that two co-channel signals (i.e., 
two signals occupying the same band in a single duration +r) being 
scrambled by different codes will produce an effect on each other 
equivalent to that of additive noise. This technique compensates for 
some deficiencies in ssp, and, being a scrambling method, may offer 
communication privacy.* The effect of applying space diversity recep- 
tion to frequency-hopped ssB with pilot tone is then considered. We 
conclude with a discussion that contains a summary and suggestions 
for further research. 


ll. PERFORMANCE OF FREQUENCY-HOPPED MODULATION IN MOBILE 
RADIO ENVIRONMENT 


To mitigate the effects of multipath and co-channel interference in 
the mobile radio environment, we propose a frequency-hopping system 
that ensures that adjacent voice signal segments are in noncontiguous 
RF bands after modulation. The basic scheme of frequency hopping 
can be applied to any modulation as illustrated in this section. Its 
operation and performance using SSB are treated in succeeding sections 
of this paper. 

Consider the transmission of a set of N single-channel voice signals 
over an RF bandwidth of B Hz. The rF bandwidth occupancy B, of 
each voice channel is, therefore, 


B 
B. — 1 
NV (1) 
Let the complex envelope of the kth speech signal s,(¢), be repre- 


sented by m,z(t), namely, 





m,(t) = sp(t) + 7&(t), (2) 
where §; is the Hilbert transform of s;(t), 
1 co 
s.(t) =~ i ee es (3) 
wy. bx 


The speech signals are band-limited to W Hz and divided into a set of 
nonoverlapping intervals of t seconds. Parameter 7 is determined by 
switching transients, vehicular speeds and available technology, and 


* It may be argued that to offer communication privacy, 7 has to be small compared 
to typical syllabic durations and scrambling codes should be changed frequently. 
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may therefore differ considerably depending on whether the environ- 
ment is urban or rural. The kth speech signal is represented by 





m,(t) = m,(t) y rect( — “) 
I=—00 





l=—oo 


= y ma(txect(—). (4) 


In eq. (4), rect(-), the rectangular window function, is defined by 


1 -—kh<x=h 


rect(x) = lo otherwise. (5) 


Suppose that in any time slot (J — 1)r < t S I7, a segment of the 
speech signal m,(t) is modulated on a carrier to occupy an RF band- 
width of B. and that this frequency band is directed to a subband p, 
1s p<=N. The subband p changes its location in the spectrum every 
7 seconds according to a scrambling code.”® 

The kth modulated voice channel can, therefore, be represented as 





l=—o T 


yt) = y F{m(t), fall, w)jreci(* ~ “|, (6) 


where F'{-} denotes the modulator output, /f;(/, w) represents the 
carrier frequency w during the /th time slot chosen for the kth speech 
channel by the scrambling code. 

Because the scrambling code is known at the receiver, we can detect 
m,(t), (2-1)r<tslt. 

Assuming the modulated rF bandwidth B, is much smaller than the 
“coherence bandwidth’”’ of the multipath interference, we denote the 
kth received RF signal as 





ix —/ 
z(t) = »Y ri(t)exp[jor(t) | F (mn. (t), Feld, w)}reet(* s "), (7) 


=—oo 


where 7;(t)exp[¢:(t)] is the complex envelope of the Rayleigh fading 
imposed by the transmission medium. This is the only type of multi- 
path interference considered here; we justify this on the grounds that 
it is usually the major source of degradation in mobile radio systems.’ 


iil. FREQUENCY-HOPPED SSB IN MOBILE RADIO ENVIRONMENT 


We now consider ssB in a frequency-hopping system. The kth 
modulated voice channel can be represented as 


1392 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1982 


fo) 


u(t) = {3 sz(t)cosl F(Z, etree) 


=—oo 








~ 5 §,(t)sin[ f.(Z, ontreei(* — “| » (8) 


where s,(t) is the speech signal in the kth channel. Further, the 
complex envelope of the kth modulated voice channel in the /th time 
slot is 


ur(t) = mz (t)exp[7fr(l, w)t]. (9) 


Note that the carrier frequency changes every 7 seconds according to 
a scrambling code and f;,(/, w) represents the carrier frequency for the 
kth channel during the /th time slot. Since the RF bandwidth occu- 
pancy B, of each voice channel in ssB is yW, where y is slightly higher 
than unity, (typically 1.1), the total number WN of radio channels that 
can be accommodated in the RF bandwidth B is 
B 

N= SW" (10) 
As in Section II, we assume that the scrambling code is such that 
successive modulated voice signal elements are in noncontiguous RF 
bands, no two voice channels are in the same band, and all channels 
are accommodated in a given band B. A typical time-frequency char- 
acteristic of the kth voice channel is shown in Fig. 1. 
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Fig. 1—Frequency hopping of &th and /th channels during successive time slots. 
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Our intention is to demonstrate that by frequency hopping the 
speech channels as described, the performance of ssB will not be 
degraded and its co-channel interference characteristics will be im- 
proved. As a prelude, we calculate the s/n of single channel ssB signal, 
that is not accompanied by a pilot tone, and is subject to Rayleigh 
fading. We show that its performance is too low to be of any practical 
value. The s/n of ssB with pilot carrier is next examined and a 
significant improvement observed. We further show that frequency 
hopping does not degrade the performance of SsB with pilot carrier. 


3.1 Effects of Rayleigh fading on a SSB signal 


Consider a ssB signal subject to rapid Rayleigh fading of the type 
found in mobile radio systems. No pilot carrier transmission or fre- 
quency hopping is used. Assuming that the modulation bandwidth B, 
is much smaller than the coherence bandwidth (typically 250 kHz in 
an urban environment), the fading of the ssB signal is frequency flat 
over the signal band, and the envelope of recovered speech signal 
impaired by fading is 


u(t) = r(t)s(t). (11) 


As r(t) is Rayleigh distributed, the probability density function (pdf) 
of r is given by 


eG eee 

es 20° 

PAX) = F awe, (12) 
0, otherwise 


where the parameter o” is a function of the transmission medium. 
We now proceed to determine the s/n of (11). In Ref. 9, s/n is 
defined as 
(s *(t ))e 


= eee (13) 
8/0 = Tat) = sO) brs 
where (-), represents the average with respect to r and 
T 
1 
2 4 lim — *(t)dt. 14 
(x (ora tim Je | a9 t (14) 
However, Ref. 1 uses 
2 
an (((u(t))r)*)¢ (15) 


~ (u(t) — (u(t)? re 


The cross-correlation of the noise and signal as defined by eq. (138) is 
nonzero, although the fading and signal are statistically independent. 
In (15), the signal is defined as 


a(t) = (u(t), (16) 
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and the noise as 
n(t) = w(t) — (p(t))r, (17) 


yielding the desired result that the signal and noise are uncorrelated. 
Consequently, we opt for the definition of s/n given by eq. (15). 
From (11) and (16) the recovered signal is 


a(t) = (u(t), = (r(t)),s(t) (18) 

and 
n(t) = [r(t) — (r(é)) |s(é), (19) 
(n(t)), = 0 (20) 

and 
(n*(t)), = (s7(t)) {(r7(t)), — (r(t))F}. (21) 

With the aid of eq. (12), 

(r?(t)), = 207 (22) 

and 
(r(0))? = 0°, (23) 


resulting in a noise power of 
(n*(t)), = (s*(t)) (2 — 27/2)o°. (24) 


The s/n becomes 
4 
s/n = i/ (4 - 1) = 3.66 or 5.63 dB. (25) 


We note that this s/n of 5.63 dB is too low to be viable in a mobile 
radio environment, unless we mitigate the effects of fading. 


3.2 Effects of Rayleigh fading on a SSB signal with pilot carrier 
(SSB + PT) 

The first step to reduce the distortion in the recovered speech, 
caused by fading of the ssB signal, is to add a pilot tone p(¢) of known 
magnitude and frequency f,."? The spectral arrangement is shown in 
Fig. 2. The frequency f, is selected to reside outside the bandwidth B, 
of the ssB signal, but sufficiently close for it to experience similar 
fading conditions. The receiver, on examining the magnitude of the 
pilot tone, can estimate the degree of fading of the ssB signal, make 
appropriate compensation, and thereby improve s/n. 

The received SSB signal is not the desired m(t), but r(t)m(t), where 
r(t) is Rayleigh-distributed. The pilot tone is used to correct the 
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distorted demodulated signal to a value 
r(t)m(t) 


p(t) = a 


where ¢ and ¢, are the phase angles of the complex fading signals 
r(t)exp[7$(¢)] and r,(t)exp[J¢p(t) ], respectively. Clearly the pdf of 
r(t) 
rp(t) 
is of importance in determining how closely ji(¢) approaches the 


original complex envelope of the speech signal m(¢). It has been shown’ 
that the pdf of A(t) is 


exp{y[$(t) — $p(d)]}, (26) 





A(t) = (27) 


2x(1 — A?)(1 + x?) 


Pa(x) = [a +22 — 4x (28) 
where A, the correlation between r(t) and 7,(t), is 
nN : (29) 


~ TL + (QrfaT,)2 


In eq. (29), T, is a measure of the time-delay spread caused by 
multipath effects in the transmission medium. Parameter fy is the 
frequency separation between the pilot tone f, and a frequency f in 
the ssB signal. Hence, it follows that there is a minimum and a 
maximum value of f, for a given pilot tone f, (see Fig. 2). 

As the noise signal is 


n(t) = fi(t) — (flt))a, (30) 
where 
(ji(t))a = (A(t))m(t)exp{j[ o(t) — dp(t) ]} (31) 


and has infinite variance, unless \ = 1, it is possible for (ji(t))4 to 
swamp the average output signal as 7,(t) can be zero. This situation is 
similar to that for AM,’ and can be prevented by ensuring that the 
compensating term 7,(t) cannot fall beyond a value r,. A block diagram 
of such a system is shown in Fig. 3. The demodulated signal is now 


f(t) = Ax(t)m(t)exp{j[ o(t) — op(t)]}, (32) 
where 
— if m=nr, 
Ax(t) = 2 : =r min( = *) (33) 
x if r<Pro see 


Parameter r, can be selected to maximize the system performance. By 
imposing this limit on rp, the signal (ji(t))4, can be shown to be’ 
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(ii(t))a, = (Axr(t))m(t)exp{j[o(t) — dp(t)]}, (34) 


where 


(Ax(t)) = [ “dr | ' aaa | ~T| 
I (Fx) 

+f ar | are) war oy | wa won| 
fi fm 2) 


To De ie 
= 80) - Veal (2-1) saa 
WS re ? Aro 
[5 Gea 7) 7 ( - 2) con = =) | 
ye foo) 
Eb, (oe 
a= r( 2k +1, (5 : 1) {yea - it) 
= oe. See ee 
rz -1) 
3 ( 
93/2(] — xD | 2b aay (5 -1)(03/ {20°(1 — xt 
a — o-oo. 
@) 


In this equation, J,,(-) is the modified Bessel function of the first kind 
and of order n, E'(A) is the complete elliptic integral of the second kind 


(35) 





given by 
(Qn —1)!)" 2 
eoagf- 3 (amy) a 
(2n — 1)!!.413.5--- (Qn —1), (37) 


and I'(p, x) is the incomplete gamma function,’ 
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Fig. 2—Spectral arrangement of ssB signal with pilot tone. 
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Pape 5 


nao N'(p + n)- (38) 


Knowing the average output signal (ji(t))4, we determine the vari- 
ance of the noise n(t), namely, 


(n?(t)) = {(AZ) — (Az)?}| m(é) [?, (39) 
where (A_) is given in eq. (35) and 


r+rz, 
(Ai) = fe ar | Ore A) a a exp| - | 
Ad (“7 u a) 
7 e rrp r+ rs 
: i sd | Ue — exp | 07 —)) 


rp A 
(3 1- z) 
1 = re 
exp| —7 3 


r2 
af = N2 ee Pe 40 
(1 r ve.( $5) Te: r2/ (20°) ( ) 
In eq. (40), E1(x) is the exponential integral”’ 
Ex(x) -| oe at x>0. (41) 
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Fig. 3—Pilot correction scheme. 


From eqs. (32), (34), (35), and (39), the s/n is given by 
7 (Az)? 
(Az) — (Az)*’ 


where (A,) and (AZ) are given by eqs. (35) and (40), respectively. 

The s/n of eq. (42) is plotted in Fig. 4 as a function of (r2/20°), where 
ro is the smallest value of r,(¢) and 20” is the variance of r(t) [or of 
r,(t)| for various correlation amplitudes X. This figure is similar to Fig. 
4.1-23 of Ref. 1. When 7, is too small, the speech signal may occasionally 
be over-corrected, resulting in poor s/n. If r, is too large, the signal 
compensation may be insufficient, even for high values of A, a condition 
of false pilot fade. Thus, for a given o” and A, there is a broad optimum 
for ro, the limit of permissible gain control. The dotted curve in Fig. 4 
is the locus of r5/2o” for optimum gain limiting to ensure peak s/n for 
a given A. 

The figure also shows that A needs to be large if the noise is to be 
low and high s/n achieved. For example, with optimum gain limiting 
and A = 0.999, the output s/n is approximately 20 dB. For a time-delay 
spread Tq of 1 ys and A = 0.999, the maximum frequency separation of 
the pilot from any part of the ssB signal must be less than 


Jin 
fa = 2rT ar 
a value that is attainable in ssB speech systems. Thus, an s/n of better 
than 20 dB can be achieved with the transmission of a pilot carrier 
and fast-acting automated gain control. 


s/n (42) 


= 7.12 kHz, (43) 


3.3 Effects of Rayleigh fading on a single frequency-hopped SSB signal 
transmitted with a pilot carrier 


After considering the effects of Rayleigh fading on a ssB signal and 
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ae 4—Signal-to-noise ratio as a function of r3/2o” for various frequency correlation 
values. 


how its reception is improved with the aid of a pilot carrier, we now 
examine the effect of frequency-hopping the ssB, plus pilot tone signal 
in the presence of Rayleigh fading. The frequency-hopping strategy 
has already been outlined in Section II. 

The recovered speech signal in frequency-hopped ssB with pilot tone 
(FH-SSB-PT) is found by proceeding as in Section 3.2, but modifying eq. 
(26) to allow for the time segmentation, namely, 





p(t) = y juoreer(’ — *), (44) 


l=-—o 


where ji:(t) from eqs. (32) and (33) is 
jin(t) = n(@min| = pexwCito ~ bolt) mt). (45) 
Tpi(t) To 


The subscript / refers to the /th time slot, and ¢:(t) and ¢,:(t) are the 
phase angles of the complex fading signals 7:(¢)exp[j¢:(t)] and 
rpt(t)exp[ Jopi(t) ] associated with the signal and pilot tone, respectively. 
Quantity m.(t) is the segment of the complex envelope of the original 
speech in the /th time slot. Equation (44) can be expressed as 
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p(t) = = Rilt)mi(t)exp{j[ g(t) — dpi(t) I}, (46) 


where F,(t) is the composite fading envelope because of the mobile 
radio environment. Notice that R,(t), given by 


Ri(t) = n(min| 2 hreei(4 — “) (47) 





Ypi(t) To 


has similar characteristics to those of Az(t) [see eq. (33)], except that 
R(t) applies for the 7th slot of t seconds duration. 
Before proceeding, we make the following assumptions. 

(1) Values of R,(t) for different /, i.e., for different time slots of 
duration 7, are statistically independent. This implies that the scram- 
bling algorithm controlling the frequency hopping ensures that the 
carrier frequencies in two contiguous time slots are separated by a 
frequency band that is larger than the coherence bandwidth of the 
mobile radio channel. 

(iz) Over a long period of time, each of the N channel frequency 
bands is likely to be occupied by each of the N voice signals. 

(uit) The fading characteristics of all the radio channels are identi- 
cally distributed. 

If we consider any two angular frequencies w, and w:, where 
w2 — a, exceeds the coherence bandwidth, their fading patterns will in 
general be very different. Figure 5 shows a typical example. Even if 


W4 
CARRIER FREQUENCY f, = oa 


rg it) 





w 
a 
3 
aa) 
Ww Ws 
2 CARRIER FREQUENCY f2 = 5— 
du 
ry(t) 
Fig. 5—Fading patterns for angular frequencies w, and we; w; — we exceeds the 


coherence bandwidth. 
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two mobiles use the same carrier frequency their fading characteristics 
will generally be different if the vehicles are separated by several 
wavelengths of the carrier frequency. In our analysis, we assume all 
fading characteristics are statistically independent. 

The recovered signal is found by averaging fi(t) of eq. (46) with 
respect to the set {R;} = {---, Ro, Ri, «++, Ri, +++} 


(B(é)) ca) = = (Ri(t) mi(t) 


=—00 


= (R(t) » mit), 


mi(t) = mi(t)exp{j[ gilt) — dpi(t)]}, (48) 
where (,(t)) is assumed to be independent of the time slot /, and is 


represented by (R(¢)). 
The noise n(t) becomes for the frequency-hopped system 


n(t) = 2 [Ri(é) — (Rilt)) |milt) , (49) 
and its variance 
(n3(t)) = {(R7(t)) — (R(t))7} yy | mu(t) |? (50) 


as R(t) for —0% < 1< © are assumed to be independent. 
The s/n is, from eqs. (48) and (50), 
P. 
t 


. 
(R(t)) | . (51) 


SS 
(R(t) a (R(t)) <2 | m(t) ? > 


<( )>- (sO), 


is the average power in the original speech signal, and 


= | maiz (t) » 


is the sum of the average powers in the segments of the speech signal. 
For Gaussian signals, it has been shown” that 


y milf) 


l=—a 








where 


3 mle) 


I=—00 
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y mit) 


l=—0oo 





(| E.mel) > 


 ImoP > 


Assuming that speech signals have approximately Gaussian character- 
istics, this result is also true for speech. Hence, 


(R(t))? 
(R(t)) — (R(t))” 





= 1. (52) 


s/n = (53) 
Since this s/n is the same as eq. (42), frequency hopping does not 

lead to any degradation in performance of the system. Note further 

that Fig. 4 can be used to determine the s/n of an FH-SSB-PT system. 


IV. EFFECTS OF CO-CHANNEL INTERFERENCE ON A FH-SSB-PT 
SIGNAL 


Consider a mobile radio system which has been allocated an RF 
bandwidth of Br Hz, where each cell is assigned a bandwidth of B Hz. 
Cells B7/B can use different parts of the Br spectrum; but if more 
than Br/B cells are required to accommodate user demand, some 
frequency bands must be reused. Let us consider two such cells which 
are spaced apart as far as possible to restrict mutual interference. 
Vehicles in each cell communicate via the cell site station using FH- 
SSB-PT, and let us concentrate on a vehicle in one cell which is 
subjected to interference, called co-channel interference, from mobiles 
in the co-channel cell. By arranging for each cell to frequency-hop 
using different scrambling codes, the co-channel interference will be 
incoherent. Observe that FH-SSB-PT operates on the interfering signal 
with the incorrect descrambling code and adds to the wanted speech 
signal short segments of numerous speech signals in the co-channel 
cell. The co-channel signal is, therefore, more noise-like and percep- 
tually more acceptable than a single coherent interfering speech signal, 
provided 7+ is suitably small. This is the main advantage of using 
frequency hopping. 

An analysis to determine the subjective nature of interference is 
very difficult and, therefore, we characterize the interference in terms 
of its mean square value. We proceed as follows. 

Since the voice signal associated with a particular frequency slot of 
width B, is different from one time slot to the next, we will evaluate 
the effect of co-channel interference in a single time slot of duration 
7. The total input to the receiver consists of the sum of the wanted 
and interfering components and is given by 
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ar = r(t)m(t)exp[ 7 (wet + b)] + rp(t)exp[ J (wpt + dp) ] 
+ S{ri{t)mi(t)exp[7 (wit + $i)] + rpilt)explJ (wpit + opi)]}, (54) 


where the subscript i refers to the interfering or co-channel signal, 
subscript p refers to the pilot tone, w, and w; are the carrier frequencies 
of the wanted and unwanted signals, and the remaining symbols have 
their usual meaning. Carrier frequencies, w, and w;, are equal as eq. 
(54) applies for one time slot. The relative amplitude 6 of the interfer- 
ence determined by antenna and propagation characteristics, and in a 
nonfading environment the signal-to-interference ratio at RF is 1/68”. 

Suppose that the pilots associated with the wanted and co-channel 
signals are off-set (say, one above and one below the ssB signal 
frequency band) with the result that the desired pilot is unaffected by 
the unwanted pilot. The demodulated speech signal is, therefore, 


pe ese =a 


p(t) r aCe) Mi esPLI ($i — Go)]- (65) 


Hence, 


a > midchexpl (di ~ 9}, (56) 
where ji(¢) is given by eq. (26). Averaging with respect to r(é) and r;(¢), 
we compute the average signal (8(t)). The noise signal is B(t) — 
{B(t)). Hence, 


B(t) = p(t) +6 


A 2 
a a ea 
where 
(A) = (Ar) (58) 
(A?) = (Ai) + 67(A?) (59) 
and 


‘i 
(A?) = C{ nin, “)| . : (60) 
Tp To ror 


Pp 


Note that (A,) and (AZ) are given by eqs. (35) and (40), respectively. 
Assuming r; and r, are statistically independent and both are Ray- 
leigh-distributed with the probability density function of eq. (13), 


2 
(A?) = (r3 5 € {min( 2 x) F (61) 
Fo To . 


(r?),, = 20° (62) 


Further, 
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and 


2 r, 
1 1 1 ae is 2 
Tp To . To 0 0 
Pp 


= 1] r 2/942 
+| =:s4e7" dr 
7 ro 


oO 


L—e rey re 
2 ha ’ (63) 


rs 20 207 
where £,(-) is given by eq. (41). Hence, 


(Ax)? 


= TAR) + OA) = (Any ee) 


s/n 


where (A?) is given in eqs. (61) to (63). Notice that if 5 = 0, i.e., no co- 
channel interference, eqs. (64) and (42) are identical. For interference 
from M co-channel cells, the s/n becomes 


(AL)? 
(AL) + 3) 83(A3) — (Ax)? 


1 


s/n = (65) 


where 6; is the relative amplitude, and (A7;) is the appropriate value 
of (A?) given by eqs. (61) to (63). 


V. FH-SSB-PT WITH SPACE DIVERSITY RECEPTION 


The impairments associated with FH-SSB-PT in a mobile radio envi- 
ronment can be alleviated by using diversity techniques. As we would 
like to maximize the number of users in a given RF band, the diversity 
methods most appropriate are space-diversity techniques. 

The simulation analysis provided in Ref. 5 of an equal gain, N 
branch diversity receiver for conventional SSB-PT is applicable to the 
FH-SSB-PT system presented here. In the receiver the individual branch 
pilot signals are phase-corrected and then averaged to provide the 
envelope correction signal. Amplitude correction is then applied as in 
the nondiversity case described in Section 3.2. 

The improvements in s/n obtained through space-diversity recep- 
tion are substantial. For example, a two-branch space-diversity re- 
ceiver with 20 dB (=10 log r2/20”) of correction gives 18-dB improve- 
ment in s/n when the delay spread Tz is as high as 3 ws. A four-branch 
diversity receiver provides another 13-dB gain in s/n. Those improve- 
ments are obtained in FH-SSB+PT when space-diversity techniques are 
applied. 
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Vi. COMPARISON OF FH-SSB-PT WITH FM 


In this section, we shall present a comparison of user densities 
obtained with FH-SSB-PT and wide-index FM. Assuming that the use of 
space diversity mitigates the effects of fading, we consider only deg- 
radation from co-channel interference. Consider a mobile radio system 
based on a cellular structure,’® where each cell is hexagonal in shape. 
Figure 6 shows two such cells; cell A is the center of a cluster of U cells 
that have been allocated an RF bandwidth of Br Hz and each cell has 
a bandwidth of B Hz. The value of U is 


U=?+i+ 7", (66) 


where 7 and j are marked on Fig. 6. There are co-channel cells arranged 
symmetrically around cell A that will cause interference with mobiles 
travelling inside this cell. One of these co-channel cells is cell B, spaced 
a distance D from cell A such that 


D 

—= v3U, (67) 
R 

where R is the cell radius. 


Suppose a mobile in cell A is at the cell boundary on a line connecting 
the center of cell A to the center of cell B, and that the cell site 
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Fig. 6—Co-channel cells for hexagonal cell structure.® 
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antennas are at the cell centers. The ratio 5” [see eq. (54)] of the 
received radiated powers by the mobile from the antennas of cells A 


and B is approximated by 
R \" 
§? = —_——$______ 
€ - 5) ' 68) 


where n varies between 2 and 4. We assume n = 3.5. However, there 
are six co-channel cells” at a distance D and other cells spaced kD from 
A, where k = 2, 3, --- . We consider only the closest six cells, as they 
cause the dominant co-channel interference, and assuming each co- 
channel cell radiates the same power and D > R, 
RF co-channel interference power 
Se es CGI) Rs 1): S69) 
RF signal power 
Following demodulation, the s/n values for the SSB and FM systems 
are 


s/n; = (70) 


637 


_fl a x° 
s/n (az) Vieq 


where the subscripts 1 and 2 refer to the ssB and FM systems, respec- 
tively, and x is the FM modulation index. 
Let the comparison be made on the basis that 


s/n = s/ne, (72) 


ot _ NE 0.46 
van s x : (73) 
Equations (67), (68), and (73) yield 


(2 U2 — ‘) _ 0.45 fais 
V¥3U; - 1 iad 


The rF bandwidths allocated to the entire system, to each cell and 
to each voice channel are Br, B, and B., respectively. The number of 
available channels is B7/B., and the number of users in each cell is 


and! 


which results in 








Ne ee (75) 
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If the cell sizes are the same in the ssB and FM systems, the user 
density p, i.e., the number of users per unit area, is 


1 Br 


~ 7 DPB oe 


p 


where D is the co-channel distance [see eq. (67) ]; i.e., we approximate 
the cluster of non-cochannel cells by a circle of radius D. The ratio of 
user densities in the SsB and FM systems is, therefore, 


2 
Pl = De Bee 
=~ (55) (32) ™ 


As an example, let the number of cells in the FM system be 12, and 
we will assume x = 2.5. From eq. (74), the number of cells in the ssB 
scheme becomes 73. For Br = 20 MHz, B.1 = 4 kHz, B.2 = 30'kHz, 
D, = R,V3U; = 14.8Ri, D2 = R2V3U2 = 6R2, and as R; = Ro, D2/D, 
= 0.405. Hence from eq. (76), p:1/p2 = 1.23, i.e., the user density of SSB 
is 1.23 times higher than that for Fm. From eqs. (69) and (70), the s/n 
is 





1 
s/n = (V3U — 1)*° = 32 dB, 
a value that is more than adequate. 


Vil. DISCUSSION 


Users of a mobile radio system want good service, being completely 
indifferent to the type of modulation employed. The designer of a 
mobile radio system, however, is faced with a hostile communications 
environment. If digital modulation methods are used the speech signals 
have to be encoded into digital waveforms that have significantly more 
bandwidth (typically a factor of 6) than that of the speech signal. 
Unfortunately, narrow-band FM with its small signal expansion is 
difficult to use for mobile radio because of its vulnerability to co- 
channel interference, while wideband FM with its acceptable perform- 
ance requires a bandwidth that is comparable to the bandwidth of 
digitized speech. However, the bandwidth of the ssB signal is the same 
as the original speech, and this has encouraged us to innovate new 
techniques to overcome inability of SsB to perform well in the presence 
of co-channel interference and fading. We were aware at the inception 
of this study that many people have analog radio receivers and, 
therefore, the capability to eavesdrop on mobile radio conversations 
using analog modulation methods. The outcome of our deliberations 
is the frequency-hopped ssB with pilot tone (FH-SSB-PT) scheme de- 
scribed in the previous sections. 


1408 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1982 


Although the concept of SsB and its compensation by a pilot tone is 
well known, we presented in Sections 3.1 and 3.2 new analytical results. 
For the case of pilot tone compensation of the ssB signal in the 
presence of Rayleigh fading, we used an alternative definition of s/n 
from that used in Ref. 9, and further, we expressed it as a function of 
A instead of limiting it to the special case of A = 1. Our results show 
that pilot compensation can achieve an s/n of more than 20 dB if 
A > 0.999 and fa = 7.1 kHz. A more relevant value of fa for speech is 
probably 3 kHz, and Tz is typically 1 ys, then from eq. (29) A = 0.9998, 
resulting in s/n = 27 dB. 

When the sspB signal, plus pilot tone is frequency-hopped, we show 
that no degradation in s/n occurs. Thus, although frequency hopping 
does not alter the s/n [compare eqs. (42) and (53)], it does have 
important perceptual advantages. For example, consider a stationary 
mobile (e.g. parked, or in congested traffic) operating in an SSB system 
that does not employ frequency hopping. Suppose the mobile is sub- 
jected to a deep fade that defies pilot correction, resulting in a complete 
disruption of communications. Now imagine that frequency hopping is 
applied; the probability that communication will be restored increases 
as the probability of repeatedly hopping into a deep fade condition is 
small. Of course, if the mobile were not in a deep fade but receiving 
excellent communications, the effect of frequency hopping would de- 
grade the recovered speech, although the degradation would probably 
be marginal. What frequency hopping ensures is that no mobile is ever 
in a permanent deep fade, provided the vehicle is not in a completely 
hostile environment, such as a tunnel, etc. 

Generally, pilot correction can be performed, and the recovered 
speech will have only occasional segments of duration 7 that have been 
obliterated by the fade. As they are likely to be in juxtaposition with 
good quality segments, speech enhancement procedures can be used 
on the degraded segments, particularly if 7 is of the order of a pitch 
period. If 7 is small (<5 ms), the model of Rayleigh fading is not 
applicable and the accuracy of our s/n analysis is unknown. Note that 
the short-time statistics of fading are also unknown. The choice of 7, 
therefore, depends on many factors, ease of switching, aid to enhance- 
ment, etc. Duration 7 also determines how noise-like the co-channel 
interference will be. From these considerations, a suitable choice of 
is between 10 to 50 ms. 

We have derived and discussed in detail the problem of frequency 
selective fading. By employing space diversity reception (see Section 
V) the signal deterioration caused by fading can be further reduced. 
This means that the factor controlling user density is co-channel 
interference, and it is here that frequency hopping makes its most 
significant contribution by changing the nature of co-channel interfer- 
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ence from cross-talk to almost band-limited white noise. Comparing 
eqs. (64) and (53), we see that the effect of co-channel noise is to 
introduce the term 6?(A?), where (A?) is noise-like, composed of 
segments of r seconds, where adjacent segments contain speech from 
different speakers in the interfering cell. By frequency hopping we 
have circumvented, in perceptual terms, the inability of the ssB re- 
ceiver to capture the wanted signal. What we are left with is the speech 
signal corrupted by noise, and we envisage that the next phase of the 
research will involve the development of techniques to strip the noise 
from the speech signal. By doing this, we will be able to operate with 
high s/n, or alternatively move the cells closer together, thereby raising 
the user density. 

Frequency hopping is not confined to sss. When applied to FM, eq. 
(53) applies with the same advantages as for sss. In terms of co- 
channel interference, there would be no need to rely on the capture 
effect of FM, and further, a measure of privacy could be obtained. 

Summarizing: SSB can combat fading if pilot tones are used, and has 
enhanced quality if space diversity reception is added. By using fre- 
quency hopping under the auspices of a scrambling code, improve- 
ments occur for stationary and slow-moving vehicles. The most signif- 
icant advantages accrue from combating co-channel interference, 
where crosstalk is perceived as random noise. Scrambling codes to 
satisfy the requirements of the system are trivial to generate and have 
not been discussed here. When compared to FM, the user density of 
FH-SSB-PT is marginally higher. Frequency hopping is not confined to 
SsB, can be applied to FM with advantage, and offers privacy in mobile 
radio where eavesdropping may be a problem. 

Finally, we comment that since ssB demodulation requires recovery 
of a carrier, linearity of its modulators, etc., it is still possible that ssB 
user density may not be higher than that of wide-index FM in all 
situations. Also, since the availability and economics of SSB are not 
discussed in this paper, we do not imply that current FM modulation 
used in the AMPS system is not technologically appropriate at this 
time. We present a new analysis of ssB and point out some directions 
of research to resolve these questions. 
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Description of Fasnet—A Unidirectional Local- 
Area Communications Network 
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Fasnet is an implicit token-passing, local-area network aimed at 
supporting high data rates and carrying a wide mix of traffic (data, 
voice, video, and facsimile). Transmission is unidirectional with 
stations attaching to the medium passively via directional couplers. 
A link consists of two lines, one to carry traffic in each direction. 
Unidirectional transmission provides the potential for efficient op- 
eration at high data rates, while the passive medium provides the 
potential for high reliability. We describe the physical configuration 
and the protocol and give channel utilization for the condition of 
continuously queued sources. Mechanisms to control the access of 
various traffic types are described. Finally, the interconnection of 
multiple Fasnets is studied for one particular configuration, a ring. 


I. INTRODUCTION 


Local computer networks operating at 1 to 10 Mb/s are being 
commercially offered and appear to adequately meet current demands 
for computer communications within the office environment. However, 
future needs stimulated by both a broader range of services than is 
now available and changes in system architecture (e.g., the trend 
towards distributed processing) could increase significantly the de- 
mand for digital capacity. For example, one would like to be able to 
handle video information, voice traffic, and facsimile, as well as com- 
puter traffic, in a single digital system. The availability of a cheap, 
high-capacity communication conduit between computers will itself 
stimulate increased traffic. For example, processing time can be traded 
for communication capacity; rather than transmitting a text file and 
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formatting it at a remote location, one may choose to transmit a 
formatted version or even a bit map. Thus, while today 10 Mb/s may 
be regarded as an extremely generous bit rate for a local computer 
network, 200 Mb/s may become limiting for an integrated communi- 
cations network. 

Carrier-sense multiple access with collision detection (CSMA/CD) is 
reliable and reasonably efficient even under heavy load for most 
conditions.! Shoch and Hupp’ show that measurements of channel 
utilization of an Ethernet* yield results that are close to calculations 
made by Metcalf and Boggs, using a simplified model and assuming 
that active stations have data continuously queued for transmission. 
Utilization is reported in Ref. 1: y = y/[y + F(M)], where y is the ratio 
of packet duration to slot time and F’'(M) is a slowly varying function 
of M, the number of active stations. Utilization is plotted as a function 
of y in Fig. 1, with M as a parameter. If we assume that slot time is 50 
ps,’ and the transmission rate is 100 Mb/s, then for an average packet 
length of 1000 bits, y = 0.2 and utilization is in the range 7 to 8 
percent.t Shorter packets, higher transmission rates, or longer slot 
times would further decrease efficiency. Also note that the above 
equation does not incorporate source acquisition and synchronization 
time which, like slot time, is relatively more significant at higher 
transmission speeds. Thus, it appears that cSMA/CD is not viable for 
operation at high data rates.” 

Fasnet is an implicit token-passing protocol developed to efficiently 
utilize the channel capacity when the ratio of packet duration to the 
maximum station-to-station propagation time is small (<1). Informa- 
tion flows in only one direction on the medium, unlike the usual csMA/ 
cD configurations (although see Refs. 5 and 6), but like csmA/cp the 
essential passivity of the medium is retained. The access method is 
closely related to a ring protocol (e.g., see Ref. 7) and may be regarded 
as a variant of implicit token passing. 

Reliability was an important consideration in the design of Fasnet. 
Consider both the transmission medium and the control electronics. 
Reliability of the transmission medium may be enhanced by keeping 
active electronics in the medium to a minimum. Bus architectures 
such as Ethernet have occasional repeaters, depending on the length 
of the signal path. Cable-Tv (CATV) type architectures have periodic 
line amplifiers whose spacing is determined by the number of stations 
(taps), as well as by cable attenuation. Ring architectures usually have 


* Ethernet is a trademark of Xerox Corporation. 

+ Since Ethernet requires y = 1.0 for collisions to be reliably detected, utilization for 
values of y < 1.0 were obtained by multiplying the utilization for y = 1 by the value of 
y. This can result in efficiencies much lower than that obtainable by other csMA 
protocols, e.g., p-persistent csMA.* 
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Fig. 1—Plot of efficiency as a function of the ratio of packet duration to slot time. 
Parameter is the number of simultaneous users. Efficiency changes little as the number 
of users goes from 5 to ~, 


most electronics in the signal path; digital regeneration is usually 
provided at each station.’ Turning to control, reliability considerations 
tend to favor distributed control. An alternative to a fully distributed 
system is to permit some stations to perform unique functions but 
have these functions assumable by any station on the network; how- 
ever, this can result in a large cost penalty. A further alternative is to 
have the function performed on a server basis (two or more stations 
provide the service to all other stations). The Fasnet medium resem- 
bles that employed in CATV, and control is primarily distributed with 
some functions assumable by all stations. 

There is considerable attraction in having a single system to handle 
all forms of traffic in the environment. Indeed, such a system may be 
regarded as an extension into the local environment of the Integrated 
Systems Digital Network (ISDN) that is being so vigorously pursued in 
the long distance and local loop environments by the common carriers.° 
An integrated transmission system simplifies the implementation of 
services that utilize different types of traffic. Examples are voice- 
annotated electronic mail® and interactive use of voice and facsimile."° 
An integrated transmission system also provides the opportunity to 
reduce overall transmission needs by taking advantage of the comple- 
mentary nature of some types of traffic; for example, most electronic 
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mail can be deferred until after the voice busy hour. Further, one 
would anticipate cost reductions in having one integrated system over 
a number of separate systems. 

While design of an integrated system offers opportunities, it also 
presents the need for compromises and trade-offs. Consider terminal 
costs and transmission efficiency. If a system is to provide economical 
interconnection for telephones and terminals, it must permit construc- 
tion of an interface that is cost-effective relative to alternative solu- 
tions. This may mean that some interfaces have to be tailored to the 
specific application to make them competitive. 

Virtually any type of traffic should be able to exploit the channel 
efficiently. For example, environments that generate a large number 
of short messages (e.g., computer terminal traffic), as well as environ- 
ments that generate a preponderance of long messages (e.g., file 
transfers), should be able to operate efficiently. This requires that 
there be a minimum of structure at the lowest common level of the 
service. For example, a packet structure which mandated a source- 
address field, while useful for computer traffic, may be unnecessary 
overhead for a voice channel where call set-up would establish the 
identity of the source. 

The description of Fasnet starts in Section II with the physical loop; 
the access protocol is described in Section III. The performance of the 
basic system is given in Section IV, followed by a discussion of some 
of the system design issues (in particular, the synchronization and 
signaling procedures) in Section V. Section VI describes variations of 
the basic system, including methods for improving efficiency, particu- 
larly when the number of users is small. Section VII describes mech- 
anisms to support the efficient management and control of mixtures of 
different traffic types. Section VIII describes the interconnection of 
Fasnets, with consideration of the impact of the topology on through- 
put and ability to handle localized sources of traffic. Section IX 
summarizes the paper. 


ll. PHYSICAL CONFIGURATION 


The basic link, as shown in Fig. 2, consists of two lines. One line 
passes all stations carrying traffic in one direction and the other line 
passes all stations carrying traffic in the other direction. For line A, 
station S, is referred to as the head station and S, the end station. For 
line B the assignment is reversed. Together the two lines provide a 
connection between any pair of stations attached to the link. While 
the lines may be either twisted pair, coaxial cable, or light fibers, we 
will be primarily concerned with a coaxial cable implementation. Each 
station makes two connections to each line. A read tap precedes a 
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Fig. 2—Physical configuration of a Fasnet link. 


passive directional coupler used for writing. The nature of the direc- 
tional coupler is such that very little energy travels in the reverse 
direction on the line so that the signal read virtually simultaneously 
from the read tap will be unaffected by the signal being written on the 
line via the directional coupler. A station writes on the line by adding 
energy to the signal already existing on the line. Except for specific 
fields of the header, the protocol ensures that only one station at a 
time writes on the line. Thus, once a signal is written on a line, it is not 
removed or changed by any station. This has certain implications for 
the line code that is selected (Section 5.2). 

Depending on the length of the line, amplifiers are needed to boost 
and compensate the signal. The technology and design procedures 
used for CATV systems’’ are directly applicable here, although the noise 
margin required for a high-quality video signal is somewhat greater 
than that required for two- or three-level digital transmission. 

Links may be joined together to form a network of links. Usually, 
links will be run in pairs of lines, but this is not always necessary. The 
advantage of using multiple links is that the traffic-carrying capacity 
of the network can be increased and reliability may be improved by 
the use of redundant paths. 

An earlier version of Fasnet’ differs primarily from the system 
described here in that a link consists of a single unidirectional line that 
passes each station twice—on the outbound or write side and on the 
inbound or read side. Each station makes three connections to the line, 
a read tap for control purposes, and a directional write tap on the write 
side, and a read tap for recovering data on the read side. The primary 
advantage of the scheme described here is that the link can carry 
approximately twice the traffic of the earlier version. A disadvantage 
is that a station must select the correct line on which to transmit, and 
this will depend on the relative physical location of the destination. 
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lil, PROTOCOL DESCRIPTION 


The data link layer may be divided into two sublayers. One 
sublayer, the logical link control with which we are less concerned 
here, provides functions like addressing, windowing, and acknowledg- 
ments. The other sublayer, the media access control with which we 
are more concerned, determines when and how to send information 
via the physical medium. This is influenced by the media type, the 
physical configuration, and the technology used. 


3.1 Frame format 


The frame structure suggested in Ref. 13 and its relation to the data 
link sublayers is shown in Table I. The information unit is delivered 
by the network layer. The logical link control appends the source 
address, the destination address, the link control field for windowing, 
acknowledgments, and similar functions. We call this unit a packet, 
and in the work described here we will assume it is of fixed length. 
The media access control sublayer appends (i) the frame check se- 
quence computed on the previous fields for error detection and (vz) 
the access control (Ac) field which determines how and when each 
station may access the physical medium. The main objective in the 
design of this field is to control access among all active stations in an 
efficient, reliable, and fair manner. The frame start and frame end 
delimiters are unnecessary, since the stations are kept in tight bit and 
frame synchronization (see Section 5.1). The duration of the frame is 
referred to as a slot. 


3.2 Access control 


Basic access control for Fasnet is as follows. The head station, S:, 
initiates a cycle on line A. After a cycle has been initiated, each active 
station on the line with packets destined in the right direction is 
allowed to access the line for one slot. To do this, each station monitors 
the line. When it senses the line idle, it seizes the line for one slot. It 
has to wait for a new cycle to be initiated before it attempts to access 
the line again. The exact manner in which this is done efficiently and 


Table |—Protocol and frame structures 


Protocol Structure Frame Structure 

Data link Logical link control sublayer DA/SA/LC/IU 

layer Media access control sublayer FS/AC/DA/SA/LC/IU/FCS/FE 
Physical layer Physical layer signaling FS/AC/DA/SA/LC/IU/FCS/FE 
FS: Frame starting delimiter LC: Link control field 
AC: Access control field IU: Information unit from network layer 
DA: Destination address FCS: Frame check sequence 
SA: Source address FE: Frame ending delimiter 
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fairly is described in the next paragraphs. If a station has priority, it is 
given permission to access the line for an integral number of slots. In 
this manner, the active stations can access the line for a specified 
duration in the order in which they are physically located on the line. 
The operation on line B is identical to that described above with Sy 
replacing S, as head station. 

To describe the operation in more detail, let {S,, So, --- , Sw} be the 
set of stations in the order of their physical locations as shown in Fig. 
2. Let AQ; and BQ; be the number of packets queued at station S; for 
access to lines A and B, respectively. 

When the next packet arrives at S; from the Network layer interface, 

if destination address j > 1, then AQ; is incremented by 1; 

if destination address 7 < 1, then BQ; is incremented by 1. 

The structure of the Ac field is shown in Fig. 3. Let tj, be the start 
of the nth frame. The Ac field is from ty, to t,,. Station S; gains access 
to line A in the following manner. Let S; be permitted access for Dmax 
packets each cycle. At ¢;,, the start of the nth frame, the read tap reads 
the START bit of the ac field. The start of cycle is indicated by 
START = 1. Because of gate delays in the decision circuitry and 
propagation delays in the tap cables, the outcome of the read operation 
is only known at ¢,,. This additional time of duration Tdec, Shown in 
Fig. 3, is of the order of a few bit times for a 100 Mb/s line and 
nanosecond logic. Next, the station may simultaneously read the 
BUSY bit via the read tap and write BUSY = 1 via the directional 
coupler. Again, the outcome of the read operation is only known at fs, 
after a delay of taec. Nonetheless, the write operation does not alter 
the BUSY bit if it is already set to 1. The nature of the signaling to 
achieve this is explained in Section 5.2. At t,, if BUSY = 1, the station 
defers until the BUSY bit of the next frame. If BUSY = 0, the station 
accesses the line for the remaining frame duration. 

Station S; is said to be in one of four states: 

IDLE —if it has no packets to transmit, 1.e., AQ; = 0. 

WAIT—if it is waiting for the start of cycle. 


—ACCESS CONTROL FIELD - 
——PACKET —— > 


START END BUSY START END BUSY 
WW Yj Y WY bit bit 


Fig. 3—The frame structure of Fasnet. Each frame consists of (¢) an access control 
field containing START, END, and BUSY bits; and (ii) the packet as provided by the 
logical link sublayer. 
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DEFER —if it is deferring to busy users who are upstream on the 

line. 

ACCESS—if it is accessing the line. 

The station makes transitions (denoted as —) between states as 
follows (Fig. 4): While AQ; = 0, S; is in IDLE. Upon arrival of a packet 
for line A, AQ; > 0 and S; > WAIT. The station reads the START bit 
of every frame. When START = 1 S; — DEFER, and the station 
simultaneously reads and writes the BUSY bit as described above for 
every frame. When BUSY = 0 S; > ACCESS. Now it accesses the line 
for Pmax frames and also writes BUSY = 1 for each. Then S; > WAIT. 
The station may cease to access the line earlier if AQ; = 0, whereby S; 
— IDLE. 

Station S, initiates cycles by START = 1 in the first frame of each 
cycle. There is an additional bit, END, in each frame to indicate the 
end of cycles. This bit can be conveniently located in the blank portion 
of the frame after the START or BUSY bits. When station Sy reads 
BUSY = 0 on line A (indicating that all active stations have accessed 
the line), it sets END = 1 in the next frame on line B. On receipt of 
this frame on line B, S; then initiates a new cycle on line A. Thus, in 
the worst case, line A will be silent once every cycle for a time equal 
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Fig. 4—State transition diagram describing the operation of a Fasnet station. 
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to twice the end-end propagation delay, plus twice the frame duration, 
as each end station has to wait until the next frame to set the START 
or END bits. 

The operation on line B is identical, with the roles of S, and Sy: 
reversed. Thus, the two lines cycle independently of each other with 
access being passed between the stations in the same order as their 
physical locations on each line. 

In the protocol described above, the outcome of the read operation 
on the START bit needs to be known before the BUSY bit is written 
so that the first frame of a new cycle does not remain idle. Should the 
START and BUSY bits be adjacent to each other, a station will only 
learn that START = 1 after the BUSY bit has passed and the frame 
will not be used. However, for large cycle lengths and short packet 
lengths, the reduction of one decision interval, taec, per frame would 
be greater than the addition of the extra idle frame. 

A further alternative is to have the first frame of each cycle contain 
only an access field. However, unequal frame sizes complicate syn- 
chronization for a very small increase in efficiency. 


3.3 Error recovery 


The protocol is controlled by the START, BUSY, and END fields. 
An error in a BUSY field will have no lasting effect; it will result in a 
packet being overwritten if the busy bit is changed from a 1 to a 0. 
Alternatively, an empty slot will go unused if the busy bit is changed 
from 0 to 1. Of more significance is an error in the START and END 
fields. If a START field is set to 1 in error, two STARTs or aSTART 
and an END would be simultaneously present on the loop. 

It will be shown that generation of additional STARTs and ENDs 
will not propagate and have little effect on the operation of the link. 
We will assume that end stations do not generate START's or ENDs 
that are closer together than the round-trip delay time, 7,; under 
normal operation this cannot occur. A false START = 1 will occur 
either in the active portion of a cycle (including the first empty slot) 
or in the empty slots occurring at the end of the cycle. If the former, 
a new cycle will start before, or as, the old one is finishing. Since the 
additional START = 1 will not generate an END = 1 on the return 
line (because no transition from busy slot to empty slot is detected), 
the condition will not propagate. If the false START = 1 occurs in the 
empty slots, other than the first, the new cycle will start prematurely 
(actually increasing utilization temporarily). One of two conditions 
results: 

(i) The busy part of the additional cycle terminates at least one 
slot before the next normally occurring START = 1, in which case the 
end station will detect an end condition. However, because the period 
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since the last END = 1 is less than 7, a new END = 1 will not be 
generated. 

(ti) There is no empty slot before the next normally occurring 

START = 1. As a result, an additional end condition is not detected 
by the end station. 
Thus, a START = 1 resulting from a fault condition will not produce 
additional END = 1 bits on the return line. On the other hand, END 
= 1 faults, unless they are closer together than 7,, will produce 
additional START = 1 slots which as just described, have a transient 
effect on the operation of the link. 

Consider the condition where a START or END bit is changed from 
a1 toaQ0Q. A new cycle would fail to initiate. After a time-out greater 
than the longest permitted cycle time, the head station will issue a 
START = 1, and the link will continue to operate normally. Should a 
head or end station fail, the station next to the head or end station 
would assume the functions on detecting loss of timing or after timing 
out on the arrival of START = 1 or END = 1. 


3.4 Fault diagnosis 


The independent lines of the Fasnet link provide the opportunity to 
localize and mitigate some types of faults. Consider first that a line is 
severed because of some catastrophic event or something trivial like a 
cable connector failing. The result will usually be either a short or 
open circuit leading to a gross impedance mismatch. The fault will 
most likely terminate all effective communication on the upstream 
side of the fault because of reflections from the mismatch traveling 
back into station interface units via the read tap. The downstream 
segment will be affected very little because the directional couplers 
will propagate little energy in the direction of the mismatch. Thus, a 
diagnostic program in the end station can determine between which 
stations the mismatch lies. This is done by having the end station send 
a query to each station via the intact line and determining which 
stations respond to the query. 

A difficult type of station fault is to have a station continuously 
write garbage on a line. Diagnostic programs in the end stations, again 
by querying each station, can determine the faulty station and remove 
it from service. The head station on the line with the fault, after being 
informed of the fault by the end station, via the other line, queries 
each station in turn. If the station fault is confined only to the write 
circuit, the faulty station will respond. The next station on the down- 
stream side will not respond, since it will not be able to read the query 
sent by the head station because of the interference from the faulty 
station. If both read and write circuits in the faulty station are affected, 
the last correctly responding station will be the station on the upstream 
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side of the faulty station. Thus, the fault is isolated to one of two 
stations. Both stations may then be disconnected by means of a control 
signal sent via the functioning line. The faulty station may then be 
uniquely determined by returning one of the stations to service. If the 
fault condition resumes, the returned station is faulty and is discon- 
nected; otherwise the other station is faulty. 


IV. PERFORMANCE 
4.1 Sample operation 


Typical operation of Fasnet for lines of 2.5-km individual length, 100 
Mb/s bandwidth, and 200-bit frame length is shown in Fig. 5. It shows 
the time-space relation of the frames on each line. The horizontal axis 
represents time divided into slots Aj, Az, As, --- for line A and By, Bo, 
Bs, -+- for line B. The vertical axis represents the physical locations 
of the active stations S|, S2, S3, S,;, and S; with S; and S; serving, 
additionally, as end stations. The electrical line length is five frames. 
Station S; initiates the cycle in frame A;, and access passes from S; to 
S2 to S3 to S,. When the end station, S;, senses BUSY = 0 in frame As, 
it sets END = 1 in frame By. Receipt of this frame by S; causes it to 


FRAMES 8, B2 B3 


° gm 


r" 


T  £Oee 


A\A 


LINE B 





S3 EA\V 
54 LK\\F 
Zz 
° 
os 
fe) Ss 
a 
z 
(3) S5 G 
> 
ES 
a S4 e 
S3 ae 
LINEA 
S2 
Sy 
FRAMES A, Ag At Ais Ais Az 


TIME IN SLOTS 


Fig. 5—A graph of activity on a Fasnet link (lines A and B) as a function of time. The 
dotted lines indicate the flow of information from one line to another. 
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initiate a new cycle in Ay. Similarly, a cycle on line B starts at B.. 
Assume that S; and S; are permitted access for up to two and three 
packets, respectively. Station S; senses BUSY = 0 in frame Bg and 
sets END = 1 in Ais. Receipt of this frame by Ss causes it to start a 
new cycle in B;7. 


4.2 Utilization 


As there are no collisions, no capacity is lost through collision 
resolution. However, the utilization is not 100 percent as each line is 
idle at the end of each cycle. The idle period is nine frames, W,, We, 
-++ Wa, on each line in Fig. 4. In the worst case, this is equal to twice 
the end-end propagation delay, plus twice the slot time (one slot time 
on average) as each end station has to wait until the start of the next 
slot to set the START or END bits. If 


v = speed of propagation on the line (m/s) 
W = line capacity (b/s) 
L = line length (m) 
F = frame size (bits) 
M = number of busy stations with downstream traffic, 
then if each station is allowed access for only a single packet per cycle, 
cycle length = 7, =M*(F/W) + 2*(L/v) + (F/W), 
duration of busy frames = 7, = M*(F/W) , (1) 


and 


eee M*(F/W) 

nen TMF W) + 2*(L/v) + (F/W) | 

The effective utilization is lower since a fraction of each frame is 

devoted to access control. However, for large F’, it results in only a 

small reduction in utilization. If vy = 2.5 x 10° m/s, W = 100 x 10° b/s, 
L = 2.5 X 10° m, 


(2) 


M*F 
1 (1 + 1*F + 2000" 
For M = 100, 
F= 50 7 = 71 percent 
F= 100 7 = 83 percent 
F = 200 7 = 90 percent 
F = 500 7 = 95 percent 
F = 1000 7 = 97 percent. 
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For the same values of v, W, and L, with F = 500'” and assuming an 
Ethernet slot time T = 50 X 107° s,? we can compare the performance 
of Fasnet and Ethernet as the number of stations is varied as shown 
in Table II. 

Unlike Ethernet, Fasnet has the desirable feature that as the load 
increases, the utilization also increases. The above figures do not 
reflect the fact that in practice the length of packets is variable and, 
consequently, the fixed frames of Fasnet frequently will be only par- 
tially filled. The effect on 7 depends on the distribution of packet size, 
and to some extent is determined by the system design. For example, 
in a system designed for large amounts of voice traffic, F could be set 
equal to the size of a voice packet. 


V. IMPLEMENTATION CONSIDERATIONS 


The design criteria previously stressed in the introduction affect the 
implementation in important ways. In particular, the requirement to 
operate at high speeds and the unidirectional operation of the bus 
affect the design of the synchronization system; in turn, the type of 
synchronization and the use of directional couplers impact the choice 
of the line code that is used. 


§.1 Synchronization 


Bus systems in which signals travel in both directions on the line 
require the receiving stations to adapt to the signals transmitted by 
the sending station because the amplitude, dispersion, and phasing of 
the received signal vary depending upon the position of the transmit- 
ting station on the line. Synchronization can be achieved very quickly 
when the signaling rate is low relative to the bandwidth of the trans- 
mission medium. At higher signaling rates, synchronization needs to 


Table Il—Fasnet versus 

Ethernet as a function of 

number of busy stations 
Fasnet Ethernet* 


M (in percent) 
5 50 4.1 
10 67 3.9 
50 91 3.7 
100 95 3.7 


* Note that since the minimum 
permissible packet length is 5000 
bits, 7 1s calculated as 0.1 of y with 
5000-bit packets. Other csMa pro- 
tocols that do not require collision 
detection perform better. 
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be more accurate to achieve good error performance. Ethernet specifies 
a synchronization preamble of 64 bits and for higher transmission rates 
an even longer sequence may be required. Thus, for short messages 
efficiency would be significantly reduced. Using a unidirectional bus, 
each station can be synchronized to a common clock issued from the 
head station. Thus, if all stations add signals to the cable in phase with 
the transmitting clock, stations will receive the signals in correct phase. 
Similarly, fixed gain and frequency compensation can be employed. 
The problem of reliability can be overcome by giving each station the 
ability to supply clock. The clock drive would be inhibited by detection 
of, and locking to, an incoming clock. 

Initial tests have shown that a simple, cost-effective method of 
synchronization is to synchronize to a continuously injected pilot tone 
placed at the high end of the signaling band. The synchronizing 
function then assumes a negligible fraction of the transmission ca- 
pacity. 

In addition to bit synchronization, frame synchronization is also 
required. This is achieved by sending periodically a synchronizing bit 
pattern. Design is simplified if this is sent after an integral number of 
frames, say 64 or 128. With tight bit and frame synchronization, 
successive frames may be butted together without a gap. 


§.2 Signaling 


Because synchronization is achieved independently of the data 
signal, line codes with fewer transitions may be considered. It is 
particularly convenient if a code is chosen that couples no energy to 
the line when one of the logic states is continuously transmitted 
(assume logic 0). Each station at the end of transmission then simply 
returns to logic 0, and there is no need to “disconnect” the transmitter 
from the line. The two line codes we are investigating are a bipolar 
three-level code, Fig. 6a, and a nonreturn to zero (NRZ) two-level code, 
Fig. 6b. The two-level signal has a greater noise margin; however, one 
has to contend with the dc signal component.” 

The Fasnet protocol does not permit subsequent stations to modify 
a signal already transmitted by an upstream source. In principle, this 
could be done. For example, a signal from one station could be deleted 
from the line by a second station writing the complement on the line. 
In practice, signal levels would have to be matched very accurately for 
such a scheme to work. 

There is one condition in which more than one station may add 
energy to the same bit in a frame—the BUSY bit of the Ac field. Asa 
result, the amplitude of this bit may far exceed the amplitude of the 
remaining signal. This may lead to errors in adjacent bits of the Ac 
field. To prevent this, guard bands on either side of the BUSY bit 
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Fig. 6—(a) Bipolar three-level line code. (b) NRZ two-level line code. 


should be used. Notice from Fig. 3 that the access field is configured 
so that the guard bands fall in the intervals tae- and, in practice, will 
have a comparable duration. 


Vi. IMPROVING UTILIZATION 


As can be seen from (1), efficiency increases (i) as cycle length 
increases and (iz) as the idle period at the end of each cycle (intercycle 
gap) decreases. At the expense of some increase in complexity, tech- 
niques may be devised to improve utilization by increasing cycle length 
or reducing intercycle gap. 


6.1 Control of cycle length 


Since START = 1 may be read by all stations, the length of the last 
cycle, 7-, may be determined by any station. As previously described, 
each station may transmit up to Pmax packets per access. Thus, by 
controlling pmax, stations may influence the value of r,. Station control 
of rt. by manipulation of Dmax is obviously limited. For example, let us 
assume that p is fixed at 1 and that we have stations each generating 
packets at a rate < 1/7,. Increasing p will not change the cycle length 
since packets will be transmitted before a queue can form. On the 
other hand, increasing p for heavily loaded stations will lead to an 
increase of t., provided 7, is less than the accepted maximum. 


6.2 Reducing inter-cycle gap 


Three methods are described for reducing the intercycle gap and 
hence increasing the line utilization. In the first, stations detecting the 
END = 1 bit seize empty slots on the other line; in the second, stations 
use the END field as a request field; in the third, the end station 
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Fig. 7—A graph of activity on a Fasnet link as a function of time. Empty slots 
encountered on line A by stations that have read END = 1 on line B are utilized. The 


IDLE period is reduced to three slots. Corresponding use of IDLE slots on line B would 
also occur but is not shown. 


attempts to estimate the end of a cycle, setting END = 1 before BUSY 
= 0 is received. 

Considering the first method, any station S; in the WAIT state that 
observes END = 1 may attempt to seize any empty slots on the 
opposite line.” The number of empty slots seized depends on the time 
the END = I frame takes to propagate to the next active station, 
which then seizes empty slots, thus preempting active stations down- 
stream.* The intercycle gap now depends on the propagation time 
from the last active station to the end station and back. (The relative 
timing of the frame starts in the two lines will also affect the gap size). 
As shown in the example of Fig. 7, the intercycle gap has been reduced 
from nine slots to three, shown for line A only. 

In the second method, stations in DEFER and ACCESS states write 
REQUEST (REQ) = 1 on the return line (the END field is now 


* A station may only transmit a single frame at a time because of the possibility of 
preemption. This will interfere with construction of “superpackets.” Superpackets are 
used to increase efficiency by reducing the effect of packet overhead. The overhead is 
attached only to the first packet of the superpacket.!” 
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replaced by an REQ field). After all stations have been served, the 
head station will read REQ = 0 and initiate a cycle. To ensure that at 
least one REQ = 1 occurs in a cycle for which only one station is 
active, a station in changing from WAIT to DEFER or ACCESS 
writes at least one REQ = 1. Approximately speaking, the average 
intercycle gap is now equal to twice the propagation time from the 
head station to the last active station, plus half a slot time. Notice that 
this procedure is more distributed in that the end-station function of 
recognizing the end-of-cycle END condition and writing END = 1 on 
the return line is now bypassed. Each station now performs an equiv- 
alent operation. 

A further refinement is to observe that the head-station function 
can also be distributed.* Each station with traffic to transmit need not 
wait for the head station to issue START = 1. Rather, after reading 
REQ = 0 on the return line, it can switch from WAIT to DEFER or 
ACCESS mode setting the p register to the allowed number of packets 
that may be transmitted per cycle. The intercycle gap is virtually 
eliminated under heavy traffic conditions and for a large number of 
users as in the first method. For two continuously queued stations, the 
intercycle gap is twice the delay time between the stations, plus one 
slot time on average. Notice that even though no START is being 
issued, or is necessary, the loop cycles. However, as in the first method, 
packets from a source during one cycle will usually not be consecutive. 
The station protocol is summarized in Fig. 8 by means of the state 
diagram. As seen in comparison with Fig. 4, the protocol is more 
complex; however, the algorithm no longer requires the centralized 
head or end-station functions. The issue of distributed control versus 
centralized control is particularly important in efficiently accommo- 
dating different types of traffic and is discussed further in Section VII. 

In the third method, each active station in the DEFER state 
indicates its desire to transmit by setting an additional request field to 
1 (REQ = 1) in the access field of the line on which it wishes to write, 
as in Ref. 12. The end station estimates the cycle length and transmits 
END = 1 timed to arrive at the head station as the last slot to be used 
in the cycle is leaving the head station. If the estimate was too low, the 
end station will read REQ = 1 in the last frame in the cycle indicating 
that the estimate of the length of the last cycle was too short. 
Therefore, the estimate of the length of the next cycle would be 
increased. If the estimate is too high, there will be empty slots prior to 
the arrival of the next START = 1, and the estimate of the length of 
the next cycle would be decreased. If the estimate is correct, then the 


; * Suggested by Z. L. Budrikis in connection with the earlier single-line version of 
asnet. 
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Fig. 8—State transition diagram of a distributed version of the Fasnet protocol in 
which unique functions in the head and end stations are not employed. Statements in 
parentheses are actions executed on making the transition. The filled circles denote 
intermediate states. 


last frame before the next START = 1 will have REQ = 0 and the 
estimate of cycle length would remain unchanged. 

Of the three methods for reducing intercycle gap, the former two 
are distributed while the latter requires additional intelligence in the 
end station. 


Vil. TRAFFIC CONTROL 


Different types of traffic have very different transmission require- 
ments. Voice traffic is an interesting example of where an initial 
restriction of access (blocking) is preferable to losing voice packets. 
Real-time traffic such as voice and video requires a guaranteed maxi- 
mum delay if information is not to be lost. In contrast, some types of 
computer traffic are rather tolerant of delay. It is not our aim to 
develop here a comprehensive algorithm for handling different traffic 
types; it would depend closely on the particular environment and the 
mix of traffic. However, we would like to indicate the mechanisms that 
Fasnet can support to control integration of traffic. 
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We can identify four different types of control mechanisms. 

(t) Selection of traffic—The ability to individually control a class 
of traffic. 

(tt) Request for access—The ability to communicate that service 
is required. 

(tit) Blocking of traffic—The ability to prevent traffic of a specific 
class from gaining access to the system. 

(iv) Continuation of service—The ability to delay traffic for later 
transmission. We will take these four control mechanisms in turn and 
consider how they may be implemented in Fasnet. 

(t) Selection—We can borrow from the strategy used by Frata et 
al. and Ulug et al.° START, instead of being a single bit, can be 
expanded to a multibit word. A START code can then be allocated to 
each class of traffic. A station would then be permitted to transmit 
only if the START corresponded to the class of traffic it is waiting to 
transmit. A class of traffic could denote a traffic type, as well as a 
priority. The term subcycle will be used to denote the period from a 
START of one class to the next occurring START. Cycle will be 
reserved for the period between two STARTs of the same class. 

(it) Request—The amount of request information can vary from 
knowing exactly which station wants to send what traffic in one 
extreme to knowing just that a station somewhere wants to send some 
type of traffic in the other extreme. A compromise would be to provide 
a request word adjacent to the END field in each packet on the return 
line. Each bit in the request word would denote a class of traffic. This 
information would enable the head station to determine that one or 
more stations required service of a particular type without indicating 
the extent of the demand. Information about demand for service would 
most likely be used to adapt the control strategy in the case where 
there was a change in the balance between traffic types. 

(iit) Blocking—The channel capacity allocated to a class can be 
controlled by the time allocated to that class as suggested in Frata et 
al.’ By issuing a different START after a given period, further traffic 
of the original class type wishing to transmit would be denied access. 
Blocking is typically used to handle overflow of the type of traffic that 
generates information periodically, such as real-time voice and video, 
and synchronous data traffic. In this instance, it is appropriate to 
speak of connections between source and destination which implies 
guaranteed access once a connection has been allocated. 

To discuss the allocation of connections in a blocking traffic class 
requires that the previous definitions of WAIT state and ACCESS 
state be generalized: 


WAIT—A station is waiting for permission to seek access to the line. 


FASNET 1431 


ACCESS—A station has a connection. 

Stations that already have access take the first free slot available to 
them after the appropriate START. Allocation of freed up slots on a 
reasonably equitable basis would proceed as follows. Stations would 
be aware of say n slots becoming free from the position of the END bit 
on the return line. Note, no END is issued if the class is full. Stations 
in the DEFER state would be permitted to compete for the n empty 
slots at the end of the subcycle. This will favor stations close to the 
head end. However, a large degree of fairness is achieved by permitting 
stations to switch from the WAIT to DEFER state only when two 
consecutive ENDS are encountered for that subcycle. This will only 
occur when all traffic currently in DEFER state has been granted 
access. At this point, traffic in the WAIT state would switch to DEFER 
and then vie for empty slots as they become available. This strategy 
is related to the snapshot algorithm.”® 

(tv) Continuation—In contrast to blocking, continuation requires 
that traffic not able to access the link in the previous cycle be served 
before any new traffic is accommodated. This may be achieved in the 
following manner. Assume that the class type is non-blocking. If the 
head station should issue a new START before all traffic of the class 
has been served, the end station will not detect the end of the cycle 
and hence will not issue END = 1. The absence of an END = 1 would 
indicate to the head station that the p registers of the stations in that 
class should not be reset on the next cycle (i.e., the stations would not 
switch from WAIT to DEFER).” Thus, in the following cycle, remain- 
ing traffic would be served. For centralized control, the START for 
this traffic type could contain an additional bit to indicate whether the 
previous cycle is being continued for deferring traffic or a fresh cycle 
is being started for new traffic. For distributed control, each station 
could keep track of the sequence of START's and ENDs. 

It is important that the control strategy be adaptive to changing 
traffic conditions. We expect that the traffic mix will change relatively 
slowly—over a period of seconds rather than ms. Thus, it would be 
feasible to have the adaptation achieved by a server process. 

The control algorithm could be implemented as completely distrib- 
uted, completely centralized, or somewhere in between. Economics 
and reliability will dictate, to a large extent, where the control should 
be placed. Nevertheless, a hybrid strategy would seem more in the 
spirit of the current design. For example, selection is probably best 
achieved by having the head station transmit the appropriate START 
code (centralized, but perhaps assumable), whereas traffic assignment 
and continuation is probably best achieved by having each station 
read and operate on the END field (distributed). 
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Vill. TOPOLOGY 


8.1 Introduction 


A population of stations may be connected together by either a 
single link (Fig. 9a) or by several interconnected links (Fig. 9b). The 
best topology will depend upon physical distribution, traffic patterns, 
and the particular performance measures that one seeks to optimize. 
We will not consider the general problem, but restrict ourselves to the 
linear interconnection of links forming closed loops. Fasnets may be 
connected as shown in Fig. 10. Packets in Fasnet 1 destined for Fasnet 
2 are addressed to station Sy. Station Sy transmits the packets to 
station S; which puts them on Fasnet 2. Similarly, for packets from 
Fasnet 2 destined for Fasnet 1. To provide reliability against single- 
station failures, interconnection stations would be provided in pairs. 
Thus, a similar connection would be provided between Sy-; and Sz. A 
detailed procedure may be specified whereby control of the intercon- 
nection passes from the Sy — S; connection to the Sy-1 — S2 connection 
in case of failure of the former. In principle, the secondary connection 
monitors both Fasnets and assumes the interconnection function after 
a suitable time-out period in event of failure. Provision can also be 
made for the primary connection to periodically check that the sec- 
ondary connection is operational. 


STATIONS 
a 


a 








SINGLE 
FASNET LINK 
/ 





(a) (b) 


Fig. 9—(a) A cluster of stations being served by a single Fasnet link. (b) The same 
station population being served by several interconnected links. 
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FASNET 1 


LINE A» 
FASNET 2 


| LINE Bp 


Fig. 10—Structure of a connector used to interconnect two Fasnet links. 


Interconnection of Fasnets as shown in Fig. 10 permits traffic to pass 
from one link to another with a minimum of delay. Since the connec- 
tion is to the first station on the link, the incoming packet can utilize 
the next occurring slot. Because of differences in frame timing between 
two links, it may be necessary to buffer a maximum of one complete 
packet; this amount of buffering is normally provided in a station 
interface. In general, interconnection of more than two Fasnets will 
require larger buffers to be employed to handle the condition where 
traffic arrives simultaneously for one Fasnet from connecting Fasnets. 


8.2 Traffic localization 


If stations on a Fasnet have traffic destined only for stations in the 
- immediate vicinity, then total utilization can be significantly improved 
by dividing the single link into separate links that are connected. Only 
traffic that has not reached its destination link is allowed to cross the 
connector. 

Consider two Fasnet links connected to form a ring with an inner 
and outer loop as shown in Fig. 11. The two connectors transfer traffic 
from one link to the other. 

Assume that the ring has a length of unity and that the first link has 
a length / where / S ¥. For a given packet, let d be the distance of the 
destination station from the source station. This distance is measured 
along the ring with the anticlockwise direction as positive, the clock- 
wise direction as negative, and the source station as the origin. Assume 
that d is a random variable with a uniform distribution over [—%, 4]. 
When de[—%, 0], the source station accesses the outer loop; when 
de[0, 2], the source station accesses the inner loop. Thus, the source 
station selects the shortest distance to the destination along the ring. 
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Fig. 11—Two Fasnet links interconnected to form a link using two connectors. 


Depending on 7 and d, the traffic to any destination may have to 


traverse zero, one, or two connectors before it can be removed. 


We first wish to determine the probabilities of the above events 
when the stations are distributed uniformly on the ring. For station S; 
selected at random on link j(7 = 1, 2), let 


p; = probability that the destination is on the inner loop, and the 
same link, and the traffic traverses zero connectors, 
q,; = probability that the destination is on the inner loop, but the 
other link, and the traffic traverses one connector, 
r; = probability that the destination is on the inner loop, and the 
same link, but the traffic traverses two connectors (7; ¥ 0 only 
if the length of the link is > and the shortest path between 
two stations at the ends of the link is through the other link). 


It can easily be shown that 
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Note that p; + g; +7; = %, (j = 1, 2) as any station accesses the inner 
loop with probability 4%. Now suppose that there are N, active stations 
on link J and N> active stations on link 2. If each active station requires 
unit capacity, then the average traffic T;(i = 1, 2) in units of capacity 
on the inner loop in links 1 and 2 is given by 


Ti = Ni(pit qi tri) + No(ge + re) + Nini (4) 

T2 = No( pe + G2 + re) + Ni(qi + 71) + Nore 
because the traffic on a line in any link is the sum of three components: 
(¢) all the traffic generated in that link, (i) that fraction of the traffic 
generated in the other link that traverses this link, and (iii) that 
fraction of the traffic generated in this link that transverses the other 
link and then returns to the first link. If N, and Ne are very large, then 
by the law of large numbers we have 


traffic on link 1 = 7, =C 


traffic on link 2 = T2 = C, 


where C is the line capacity. 
It can be shown that N, + Ne is maximum when 7 = % and 


(5) 


M+ M=5C. 


If no connectors were used, we have a single Fasnet link (it can no 
longer be a closed ring) for which 


N, + Ne = 2C. 


Hence the gain G = (8C/3)/2C = 4/3. Thus, we are able to obtain a 33 
percent increase in the effective network capacity even for uniformly 
distributed traffic by having two diametrically located connectors.* 
We now extend the above analysis to the case of K connectors Ci, C2 
-++ Cx, which are located symmetrically around the ring as shown in 
Fig. 12. 
It can be shown by similar means that 


4 
re 

te 
K 


G= (6) 





This is plotted in Fig. 13. 

We then extend the same analysis to the case with K symmetric 
connectors Ci, C2 --- Cx, but with an arbitrary traffic distribution 
symmetric about the source station and extending from —% to +% 


* While this analysis does not pertain to a specific access protocol, the effective gain 
can be closely realized by the Fasnet protocol and the connector structure of Fig. 10. 
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Fig. 12—A Fasnet link with multiple connectors. 
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Fig. 13—Plot of the gain in traffic handling capacity of interconnected links relative 
to a single link as a function of K, the number of connectors. 


along the ring. This preserves the shortest path routing, as well as 
sharing the load equally between the two loops. 
It can be shown by similar means that for large K 


kK 1 
1+KD 1. |’ 
=+D 
K 
where D is the expected value of |d|, the absolute value of the source- 
destination distance. The approximation becomes exact if the distri- 
bution is uniform or if K — o. Note that for the uniform distribution, 
D = % and 


Ga (7) 
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as before. For a given traffic distribution, (7) is a good design formula 
for how capacity costs can be reduced at the cost of extra connectors. 
However, this trade-off is useful only if accurate estimates of capacity 
and connector costs are available. 

The above analysis highlights certain interesting features. For the 
uniform traffic distribution, it is seen that the gain, G, does not increase 
uniformly with K. This is fairly intuitive. Since the traffic is uniform, 
an extra connector, when K is large, removes the traffic over only a 
short link and results in only a marginal increase in the gain. On the 
other hand, as the traffic distribution becomes more localized (i.e., 
D — 0)G increases uniformly with K. This is again fairly obvious, as 
with a high degree of localization the traffic on each link is almost 
independent of the traffic on the other links. 

We have considered here some configurations of interconnected 
links. There are interesting graph theoretic questions relating to reli- 
ability. For example, given a graph like Fig. 9b, what is the minimum 
number of additional links and their position so that full connectivity 
is still maintained if any link is cut at a single point? Development of 
realistic models of the physical traffic and cost structures of local 
environments still remains. There is a paucity of statistics (except for 
Ref. 2) on the local network parameters. Future operational local 
networks will hopefully furnish statistics on which to build more 
accurate models. 


IX. CONCLUSION 


The physical configuration of Fasnet consists of two communication 
lines passing each station. One line carries traffic in one direction, 
while the other line carries traffic in the opposite direction. Thus, this 
configuration carries twice the traffic of a previous system in which 
the two lines were connected at one end so that traffic was written on 
the outbound line and read from the inbound line. Each station makes 
two connections to each line, a nondirectional read tap and a direc- 
tional write tap. Reliability of the physical medium is high because it 
contains no active electronics. The access protocol is partly centralized 
in that bit synchronization, framing, and start-of-cycle are provided by 
the end stations; however, these functions would be assumable by any 
station upon failure of an end station. 

The access protocol is as follows: Upon reading a start-of-cycle, a 
station may transmit a prespecified number of packets in the first 
available empty slots. When all stations have transmitted their pack- 
ets, a signal is sent on the return line to inform the head station to 
start a new cycle. The efficiency of Fasnet increases as the length of a 
cycle increases; cycle length depends upon the length of a packet, the 
number of active stations, and the number of packets, Pmax, that each 
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station is permitted to send in a cycle. By adaptively changing Dmax, 
efficiency can be maintained at a high level even for a small number 
of active stations. A number of techniques for further improving 
efficiency are suggested. A trade-off is necessary between increasing 
the complexity of the protocol, on one hand, and the resulting small 
improvements in efficiency on the other. 

Bit synchronization of the stations is achieved through adding an 
out-of-band pilot tone, while framing is achieved through a periodically 
inserted code word. A three-level bipolar line code is preferred. 

The potential of Fasnet for operation at high transmission rates 
makes it attractive as a conduit for the various types of traffic that 
may flow in a business environment. Mechanisms have been proposed 
to implement blocking, delaying, and request-for-service operations 
that are needed if mixed traffic is to be handled efficiently within a 
single medium. These operations can be implemented centrally or they 
can be distributed. The low-level access operations are best distributed 
while the more complex operations are best centralized. 

Fasnets may be interconnected to increase the load that may be 
carried or to improve reliability. Investigation has been restricted to 
the connection of Fasnets to form a ring. As the number of segments 
in the ring increases, the throughput first increases rapidly (assuming 
uniformly distributed traffic). After about five segments, the increase 
is very small. Exploration of other topologies presents a challenge. 
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On the Distribution Function and Moments of 
Power Sums With Log-Normal Components 


By S. C. SCHWARTZ* and Y. S. YEH 
(Manuscript received July 20, 1981) 


An approximate technique is presented for the evaluation of the 
mean and variance of the power sums with log-normal components. 
Exact expressions for the moments with two components are devel- 
oped and then used in a nested fashion to obtain the moments of the 
desired sum. The results indicate more accurate estimates of these 
quantities over a wider range of individual component variances 
than any previously reported procedure. Coupling our estimates with 
the Gaussian assumption for the power sum provides a characteri- 
zation of the cumulative distribution function which agrees remark- 
ably well with a Monte Carlo simulation in the 1 to 99 percent range 
of the variate. Simple polynomial expressions obtained for the mo- 
ments lead to an effective analytical tool for various system perform- 
ance studies. They allow quick and accurate calculation of quantities 
such as cochannel interference caused by shadowing in mobile te- 
lephony. 


Il. INTRODUCTION 


The power sum with K independent components 
K 
Pr = 10 loo] Ds 108" (1) 
k=l 


is a random variable which appears in many areas of communications. 
With X, Gaussian, the quantity 


Ly = 10%" (2) 


is called a log-normal variate. The characterization of the sum of L; is 


* Professor Schwartz is currently with the Department of Electrical Engineering and 
Computer Science, Princeton University. 
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of importance in multihop scatter systems,’ log-normal shadowing 
environments,”” target detection in clutter,*” and the general problem 
of propagation through a turbulent medium.* Thus, the distribution 
and moments of Px are quantities of considerable importance. Unfor- 
tunately, these quantities do not appear to be expressible in simple 
analytical formulae and, as a consequence, approximate procedures 
have been investigated for some time. Of particular interest is the 
Wilkinson approach which uses a normal approximation for the distri- 
bution of Px. The problem of characterizing the distribution function 
then reduces to finding the first two moments of the power sum. 

The Wilkinson approach is consistent with an accumulated body of 
evidence indicating that, for the values of K that are of interest, the 
distribution of the sum of a finite number of log-normal random 
variables is well-approximated, at least to first-order, by another log- 
normal distribution.’*""! The central question, then, is how to estimate 
the mean and variance of the approximately Gaussian variate Pr, i.e., 
the power sum. Knowledge of the mean, m,, and standard deviation, 
o,, of the Gaussian random variable X; leads to a complete specification 
of the log-normal variate L;, assuming, of course, no location param- 
eter for the log-normal. Yet, the procedure for then estimating the 
moments of the power sum variable 


Pxr= 10 logio(L) = 10 logio(Li +DLot+---+ Dx) (3) 


based on the individual moments m,, o,, is by no means straightfor- 
ward. As discussed in the next section, the Wilkinson approach leads 
to useful results only for a limited range of small values of the dB 
spread o,. Unfortunately, this is not the o, range of most practical 
interest. 

The purpose of this paper is to outline a procedure that appears to 
be more accurate and to have a wider range of applicability than 
previously reported approximations. The new technique can be easily 
summarized: Analytical formulae are developed to compute the exact 
mean and variance for the power sum with two components, 


P, = 10 log(Z; + Lz). (4) 


We then suppose P> is Gaussian, i.e., it is assumed that LZ; + Le = 
1072/° is log-normal. We then consider 


P; = 10 log(L; + Lz + Ls) = 10 log(1072" + Ls) (5) 
and compute the mean and variance using the derived formulae. In 


this manner, we iterate until the required moments of Px are obtained. 


* Many more applications, including the field of economics and the estimation of 
crude oil reserves, are noted in Refs. 6, 7, and 8. 
t We also confirm this observation here by means of Monte Carlo simulations. 
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We have compared the results of a Monte Carlo study to the 
estimates of moments obtained from the new method just described, 
and permuted versions of it. Our procedure leads to remarkably 
accurate estimates for the mean of Px, over a wide range of component 
variances. The estimate of the standard deviation of Px is accurate for 
a somewhat more limited range but, nevertheless, a much wider range 
than for previously reported approximations. 

The rest of this paper is organized as follows. In the next section, we 
establish additional notation and discuss, in more detail, previous work 
relating to power sums. The analytical development is presented in 
Section III, with the details reserved for the Appendix. The develop- 
ment of both accurate and simple expressions for the mean and 
variance of P2 is of considerable importance for system performance 
studies in areas such as shadowing in mobile radio and log-normal 
fading in microwave radio. In Section III, we also present such an 
expression in the form of simple polynomials. Simulation results, 
validating both the Gaussian approximation for Px and the new 
estimation technique, are presented in Section IV. Concluding com- 
ments are given in Section V. 


il. PRELIMINARIES 


The power sum, as defined in (1), is measured in decibels. However, 
rather than use the definition for L; given in (2), it is more convenient 
to use the natural logarithm: 


Lr =e**. (6) 
The relationship between the two associated normal variates is simply 
Yr = AXz, (7) 

where 
A = “o logel0 = 0.23026. (8) 


When the mean and variance of X;, are specified as, say, mx and ox, in 
order to use the representation (6), we have the obvious scaling 


my=Amz, a = A’or. (9) 
Similarly, we define the quantities 
L = 10? = e”, (10) 
and the corresponding moments are related by 
m:=)mp,, 62 =D’o?,. (11) 
The rth moment of a log-normal variate L is given in terms of the 


moment-generating function of Y: 
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E(L‘) = E(e’™?) = e7M,41/2(r'o5,)_ (12) 


Note that the moments grow exponentially with the order r’. This 
rapid growth of moments may be one of the reasons for the limited 
range of applicability of the classical Wilkinson approximation (see 
Ref. 11); dealing with diffferences of large numbers can lead to numer- 
ical instability in trying to evaluate the approximate log-normal density 
of the sum. 

Marlow” has shown that under quite general conditions, as K > 
oo, Px is asymptotically normally distributed.* For fixed, finite K, and 
with the Xx independent, identically distributed normal random vari- 
ables (with mean m, and standard deviation o,), Marlow also derived 
a small variance normal approximation.” As ox, — 0, a scaled version 
of Px approaches the distribution function of a unit normal. 

In terms of log-normal variates, if we write 


Pr = 10 logio(L) = 10 logio(L1 + Le teen + Lx), (13) 


Marlow’s second result says that, in the finite-component, small-vari- 
ance case, the sum of log-normal variates is also approximately log- 
normal. This result forms the analytical basis for the classical Wil- 
kinson approximation, which is the usual normal approximation for 
the distribution of the power sum. 

The Wilkinson approximation proceeds as follows: With 


L = e%, (14) 


now taken to be log-normal, Z is the associated Gaussian random 
variable with parameters m, and o-. To find these quantities, one uses 
(12) and equates the first two moments of both sides of (14). For 
example, with K = 2, and Yj, and Y>2 identically distributed with mean 
my, and variance o%, 


E(L) = emtl/202 = JDoem,+1/20; 
E( L’) = e2m.t20: — Jp2m,+20; 4 2(em+1/205)? (15) 


Taking logarithms gives a set of linear equations for the two unknowns, 
m, and o. 

Figure 3, which will be discussed in detail later, presents the results 
of a Monte Carlo simulation and the Wilkinson approximation, em- 
ploying eq. (15). As indicated, the Wilkinson approach tends to break 
down for o, = o,/A greater than 4 dB. This is consistent with the 
above-quoted result from Marlow. 


*For the infinite component case, Marlow also points out that there is asymptotic 
normality both on the power scale, as well as on the dB scale. That is, both Px and L are 
asymptotically normal. 


1444 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1982 


If one is interested in small o, but larger values of the random 
variable, i.e., the tails of the power sum distribution, Fenton’ suggests 
a better approximation to the equivalent log-normal distribution by 
utilizing higher-order moments. In this case, eq. (15) is replaced with 
equality of, say, third and fourth moments. This procedure of using 
different values of r in different regions results in an approximating 
distribution which would be a series of connected straight lines when 
plotted on log-probability graph paper. 

At the other end of the o, component range, Farley’ has derived a 
large variance approximation. With independent, identically distrib- 
uted variables, as 0, — © 


Pr| 2 (Px — mx) < “| = [®(a)]*, (16) 
where ®(a) is the unit normal distribution function. (Clearly, this is a 
decidedly non-Gaussian result.) Farley’s simulations indicate that the 
large variance approximation gives better results than Wilkinson’s 
approach for about o, > 10 dB and K fixed. 

In this study, we also take K finite, but focus on the midrange, 4 = 
0, = 12 dB, where neither Marlow’s nor Farley’s results are applicable. 
This midrange is of particular interest for the shadowing phenomena 
in mobile telephony, microwave radio fading, and airborne radar 
clutter. 


lll. ANALYTICAL RESULTS 


We assume that 
K K 
L=) Lk= ¥ e™* =e? (17) 


for finite K is log-normal. The associated Gaussian variate is Z, and 
the goal is to determine the quantities 


m, = E(in L) 
o: = E{(In L — m.)”] (18) 


in terms of the mean and standard deviation of the component variates 
Y:. 

Below, we give exact expressions for mz and o, when there are two 
components. The procedure for K greater than two combines the log- 
normal variates in a nested fashion two-by-two, using the exact for- 
mulas developed. To illustrate the procedure, suppose we have three 
log-normal variates: L = L,; + L2 + L3. The exact first and second 
moments of In(L; + Lz) are computed. We then take 


Zz = In(L; + Le) (19) 
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as a normal random variable and write* 
= In(L) = In(e” + L;) = In(e” + e*). (20) 


We then compute the moments of In(L), again using the exact expres- 
sions developed for two components. This procedure has been tested 
in a variety of situations and groupings of variates. The results are 
very encouraging and will be discussed in the next section. Here, we 
outline the development of the analytical formulae, with the details in 
the Appendix. 

Naus’” has derived the moment-generating function, and computed 
the first two moments of the power sum with two independent, 
identically distributed normal components. (Hamdan”™ generalized 
Naus’ result for correlated variables with unequal variances.) Here, we 
extend Naus’ general approach to the case of two independent vari- 
ables with unequal means and variances. 

For the two components, we have 


f=L=1,+Ll,.=e' +e” (21) 
or 


Z = In(e™' + e*?). 


Define the Gaussian random variable 


w= Y2 = Y, (22) 
My = W= My, — My, (23) 
Ow = 5, + O5,. (24) 


Taking the expectation of Z, we have 
E{Z} = EfIn(e™' + e*)] 
= E{infe™'(1 + e’™)]} 
= E(Y;) + E{In(1 + e’)]. (25) 


The second term is 


E{In(1 + e”)] = | [In(1 + e”) ]f(w)dw, (26) 


where f(w) denotes the normal density with the above-indicated mean 
and variance. The logarithmic term is now expanded in a power series. 
To ensure convergence of both the power series and the series resulting 
from the subsequent integration, the integral is broken up into the 
appropriate ranges. Thus, 


* Equivalently, L,; + Lz is assumed to be log-normal. 
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pes 0 
[ In(1 + e”)f(w)dw = | In(1 + e”)f(w)dw 


+ | [In(1 + e~”) + w]f(w)dw. (27) 
0 ; 


The expansion 


00 : (-1)/} 
In(1 + x) = by Gx, C; = F ; (28) 


J=1 





valid for |x| < 1, can now be used for each of the above integrals. The 
result (details are given in the Appendix) for the first moment is 


m, = E{In(L; + Le) | 


Ow Ses Mwy 
Ss en Mi/ 26% + My o(“) 


= My, 
2a Ow 
= = ae ee 
age 2 Cet #2 em ofr A) 
k=) a 
w — Row 
+ eo bmw o(™— k )| 
Ow 
= My, + Gi(ou, Mw), (29) 


where ®(<x) is the Gaussian probability distribution function as defined 
in the Appendix, eq. (39), and G; is defined implicitly. It is not difficult 
to show that the series in (29) converges. Our computer experience 
indicates that about 40 terms are required for the 4th significant digit. 

As might be expected, the formula for the second moment is more 
complicated and is given by eq. (72) in the Appendix. The basic idea 
however, remains the same: When expansions of logarithmic functions 
are used, the expectation is broken up into integration ranges which 
provide convergent series. 

Although the formulae as developed involve infinite series and are 
complicated, their application is relatively simple. We can write [see 
eqs. (49) and (73) in the Appendix]: 


Mz = My, + Gi(Gw, Mw) 
oz = 05, — Gi(ow, Mw) — 29°G3(ow, Mw) + G2(ow, Mw); (30) 
where the Gi(o., mw) are defined in the Appendix and 
p = —0y,/Ow. (31) 


Observe that the G; functions, 1 = 1, 2, 3, depend only on the two 
parameters my, ow. Consequently, we can evaluate Gi{o., mw) as a 
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Table la—Coefficients of the approximating polynomials—Region | 


[-20 < my < 0, 0 <0, < 15] 


Functions 
Coefficients Gl G2 G3 
A00 —0.1153239E 00 0.4012876E-01 —0.3958699E 01 
AOl —0.5667912E 01 —0.4483259E 01 ~—0.5454983E 01 
A02 0.1151279E 02 0.7391760E 01 0.1139280E 02 
A03 —0.7162489E 01 —0.3772190E 01 —0.7116366E 01 
A04 0.1312986E 01 0.5262268E 00 0.1315218E 01 
Al10 —0.1611385E 00 —0.1579114E 01 0.6839918E 01 
All 0.2084215E 02 0.1637249E 02 0.1962529E 02 
Al2 —0.4499768E 02 —0.2994901E 02 —0.4314091E 02 
Al3 0.2756210E 02 0.1837812E 02 0.2647795E 02 
Al4 ~0.5109783E 01 —0.1747459E 01 —0.4940592E 01 
A20 0.13845124E 00 0.2174588E 01 —0.4717296E 01 
A21 —0.2670183E 02 —0.2114152E 02 —0.2486802E 02 
A22 0.5919191E 02 0.3976749E 02 0.5618572E 02 
A23 —0.3695334E 02 —0.1776890E 02 —0.3501815E 02 
A24 0.6912766E 01 0.2195322E 01 0.6560938E 01 
A30 0.8057054E-01° —0.7525302E 00 0.1819360E 01 
A381 0.1429709E 02 0.1147157E 02 0.1323538E 02 
A32 —0.3225969E 02 —0.2189501E 02 —0.3050022E 02 
A33 0.2055628E 02 0.1006767E 02 0.1937757E 02 
A34 —0.3888684E 01 —0.1224983E 01 —0.3658379E 01 
A40 —0.3145306E-01 0.8447987E-01 —0.2817493E 00 
A41 —0.2730047E 01 —0.2222839F 01 —0.2518272E 01 
A42 0.6244253E 01 0.4289557E 01 0.5893337E 01 
A43 —0.4048245E 01 ~0.2035757E 01 —0.3809492E 01 
A44 0.7746786E 00 0.2499568E 00 0.7260144E 00 


function of these two parameters and store the results in three lookup 
tables. An alternative is suggested by Fig. 5, which gives the three G;’s 
as a function of o, with m, as a parameter. Since the curves are 
smooth, we can fit low-order polynomials to them and use the resulting 
expressions in (30) for the evaluation of the moments m, and o,. The 
advantage of utilizing these analytical expressions in system perform- 
ance studies is clear and, indeed, is what we have implemented. 

A least-squares fit was performed to determine the polynomial 
coefficients. After some experimentation, we chose the polynomial 


J K 
logioGi(ou, Mw) = > >; Ap(iox? cr ae 
j=0 k=0 


t=1,2,3, (82) 


with J = K = 4. The coefficients A;,(z) were obtained for two regions 
of the parameter space. The results for Region I (-20 = m, = 0,0 
Ow = 15) and Region II (—40 = m, S —20, 0 S o, = 15) are presented 
in Tables Ia and Ib. Over these regions of interest, the maximum error 
produced by using the polynomial approximations was found to be 
about 1 percent as compared to the exact calculations based on eqs. 
(49) and (73). 

Before discussing the results of the simulation study, we digress to 
note how the analytical results can be extended. First, the development 
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Table Ib—Coefficients of the approximating polynomials—Region II 


[—40 < m, < —20, 0 <a,< 15] 


Functions 
Coefficients Gl G2 G3 
A00 —0.3354792E 03 0.8942737E 02 —0.3596955E 03 
AOl 0.5281230E 03 —0.1521976E 03 0.5576596E 03 
A02 —0.3085414E 03 0.9734654E 02 —0.3242909E 03 
A03 0.7895054E 02 —0.2744724E 02 0.8271772E 02 
A04 —0.7518427E 01 0.2756644E 01 —0.7863267E 01 
Al10 0.1354559E 04 —0.3688427E 03 0.1443269E 04 
All —0.2133838E 04 0.6287175E 03 —0.2250748E 04 
Al2 0.1245491E 04 —0.4077784E 03 0.13807118E 04 
Als —0.3191052E 03 0.1148104E 03 —0.3336339E 03 
Al4 0.3036738E 02 —0.1161187E 02 0.3167827E 02 
A20 —0.1865732E 04 0.4966108E 03 —0.1986007E 04 
A21 0.2941369E 04 —0.8514775E 03 0.3106047E 04 
A22 —0.1718295E 04 0.5574036E 03 —0.1804377E 04 
A23 0.4403198E 03 —0.1588548E 03 0.4603419E 03 
A24 —0.4188256E 02 0.1624075E 02 —0.4366067E 02 
A30 0.1059090E 04 —0.2684747E 03 0.1128759E 04 
A31 —0.1671264E 04 0.4638312E 03 —0.1768423E 04 
A32 0.9776851E 03 —0.3061915E 03 0.1028409E 04 
A33 —0.2507169E 03 0.8842364E 02 —0.2624284E 03 
A34 0.2384698E 02 —0.9152624E 01 0.2487490E 02 
A40 —0.2121206E 03 0.5077676E 02 —0.2264791E 03 
A4l 0.3350888E 03 —0.8833020E 02 0.3553536E 03 
A42 —0.1963238E 03 0.5876009E 02 —0.2069294E 03 
A438 0.5040234E 02 —0.1717538E 02 0.5284466E 02 
A44 —0.4795980E 01 0.1799387E 01 —0.5008795E 01 


is valid when the random variables Y,; and Y2 are correlated as well as 
having different means and variances. With a simple modification in 
eq. (24) to account for the correlation [and another in (52)], 


Ow = Oy, + dy, — P120y,0y»5 (33) 


the results remain unchanged. Furthermore, in principle, the under- 
lying component variables do not have to be Gaussian. The proviso, of 
course, is that the integration with the non-Gaussian f(w) be tractable, 
and it must fall off fast enough so that the resulting series is convergent. 
(Clearly, the components are no longer log-normal.) 

Finally, we observe that the results can be extended in yet another 
direction. By generalizing the procedure outlined above, exact formulae 
for moments of more than two components can be obtained. However, 
the resulting expressions are very complicated and, given the high 
accuracy of our new method, the added complexity does not appear to 
be warranted. 


IV. SIMULATION RESULTS 


We use the definitions introduced earlier: 


L;, = 10% (34) 
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K 
Pr = 10 logio(L) = 10 oso] y Ls 3 (35) 
k=1 


In the following discussion, by m, and o,, we shall mean the first 
moment and standard deviation of the random variable Px. The value 
of K should be clear from the context. We focused our Monte Carlo 
study on three issues: 

(t) How good is the assumption that L = L, + [2 + --- + Lx is 
approximately log-normal or, equivalently, how well is the cumulative 
distribution function (cdf) of Px = logio(L) described by a Gaussian 
cdf? Furthermore, how closely does the Gaussian cdf (based on the 
calculated m, and o,) match the true cdf of the power sum? 

(tt) How accurate is the new analytical method in estimating the 
resultant mean m, and standard deviation o, and for what range of dB 
spread o, and component number K is the technique accurate? 

(zit) Is the nested procedure we have described numerically robust, 
or are the estimates of m, and o, sensitive to the order in which we 
combine the log-normal components? 

The results of our Monte Carlo simulation confirmed the observation 
of a number of other investigators, that the cdf of Px is well approxi- 
mated by the Gaussian cdf, particularly in the range of practical 
interest.* Shown in Fig. 1 is the cdf of the sum of two log-normal 
variates with o, = 6, 10, and 14 dB. We note that the Monte Carlo 
simulation agrees quite closely with the assumed Gaussian cdf (based 
on the calculated m, and o,) in the range of 0.1 to 99 percent. Outside 
this range, the simulation values start to deviate slightly. The cdf of 
the sum of a large number of log-normal variates, discussed below in 
Examples 1 to 3, together with a fourth case (the sum of six equal 
components with m, = 0 dB and o, = 10 dB), are presented in Fig. 2. 
Here we observe excellent agreement between the calculated (as- 
sumed) and simulated cdf in the range 1 to 99 percent. 

Having confirmed the Gaussian approximation for Px and the fact 
that the calculated cdf closely represents the true cdf, we turn our 
attention to the second issue, estimating the moments m, and o, from 
the moments of the individual log-normal components. In Figs. 3a to 
3c, the number of identically distributed components, K, is held fixed 
and the component variance (dB spread) o? is varied. In all cases, the 
component mean m, is 0. Figure 3a illustrates the results for two 
components and serves to verify the Monte Carlo program. Our 
computed results coincide (as they should) with the simulation. In 
contrast, the Wilkinson approximation begins to give inaccurate results 


* The simulations typically had 10,000 sample points for each value quoted or 
indicated on the figures. 
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CALCULATED 
© MONTE CARLO SIMULATION 


DECIBELS 





0.05 0205 1 2 5 10 20 30 40 5060 70 80 90 95 98 99 99.8 99.99 
PERCENT PROBABILITY THAT AMPLITUDE < ORDINATE 


Fig. 1—Cumulative distribution function of sum of two log-normal variates with zero 
means and variances equal to 6, 10, and 14 dB. 


CALCULATED 
O MONTE CARLO SIMULATION 


DECIBELS 





0.05 02051 2 5 10 20 30 40 5060 70 8 90 95 98 99 99.8 99.99 
PERCENT PROBABILITY THAT AMPLITUDE < ORDINATE 


1. THREE VARIATES: m = 0, 0 =6,7,9.5. 
2. SIX EQUAL COMPONENTS: m = 0, o = 10. 


3. NINE COMPONENTS IN THREE GROUPS: GROUP 1, m = -10, O= 6; GROUP 2, m= -18, 
@ = 10; GROUP 3, m = -38, 0= 12. 


4. EIGHTEEN COMPONENTS IN THREE GROUPS: GROUP 1, m = 10, o= 10; GROUP 2, m = -2, 
0 = 10; GROUP 3, m = -8, a= 10. 


Fig. 2—Cumulative distribution function of the sum of a large number of log-normal 
variates corresponding to Examples 1 to 3 in Section IV plus an additional case. 
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Fig. 3—Mean and standard deviation of power sum with (a) two, (b) 5:x, and (c) ten 
log-normal comonents as a function of comopnent standard deviation. 


12 





Mp IN DECIBELS 


at about o, = 4 dB and becomes increasingly worse with increasing 
Ox. 

Figure 3b illustrates six components, and Fig. 3c illustrates ten. The 
dB spread ranges from 2 to 14 dB. Our technique gives remarkably 
accurate estimates of m,, while the estimate of o, is quite close for 
ox < 8 dB and then tends to underestimate the true standard deviation. 
By way of contrast, for these ranges and number of components, the 
Wilkinson approach gives gross inaccuracies and cannot be used. 

In Figs. 4a and 4b, we vary the number of components, set m, = 0, 
and hold the dB spread fixed. Again, our procedure gives very accurate 
estimates of the mean and, as the number of components increases, 
the variance is underestimated only slightly for o, = 6 dB (Fig. 4a) and 
somewhat more so when o, = 10 dB (Fig. 4b). For these cases, the 
error in the estimate of the variance as a function of the number of 
components is as shown in Table II. (There are negligible errors in 
estimates of the mean.) As indicated in the figures, the errors in the 
Wilkinson approach are substantial, ranging up to 74 percent when 
ox = 10 dB. 

It should be observed that the situation with equal mean components 
and large dB spread is probably a worst-case situation. Indeed, from 
the above table we see for K = 16, o, = 10 dB, that the estimate of o, 
is off by 12.8 percent. In contrast, in Example 3 below, we deal with 
K = 18 components, and the same dB spread. There, the error in the 
estimate is less—about 6 percent. The only substantial difference in 
the two situations is the unequal mean values. These comments 
notwithstanding, further study is warranted for this boundary of 
parameter values, i.e., where the number of equal mean components 
is high (K = 16) and the dB spread is identical and large (o, = 10 dB). 

Our technique was also tested on a number of examples with unequal 
components. We report on three that were chosen to illustrate the 
general applicability of the new method. 


Example 1—Three components, equal means, different variances: 
m,,=0dB, 1=1, 2, 3; Ox, =6, 6x,=7, ox, = 9.5 dB. 


The Monte Carlo simulation gave m, = 8.08, o, = 5.356. Our procedure 
provided the estimates m, = 8.05, op = 5.273, for a 0.03-dB error in the 
mean and 1.5 percent error in the variance. 


Example 2—Nine components, taken in homogeneous groups of three: 
Mz, = —38 dB; ox,=12dB, i=1,2,3 
mz, = —18 dB; ox,=10dB, i= 4, 5,6 
mz, = —10 dB; ox, =6dB, 1=7, 8,9. 
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Fig. 4—Mean and standard deviation of power sum with varying number of identically 
distributed components. (a) Component mean, m, = 0 and standard deviation, ox = 6 
dB. (b) Component standard deviation o, = 10 dB. 


Monte Carlo results gave m, = —0.61, 0, = 3.90, while our technique 
yielded m, = —0.6, 06, = 3.79. This gives an error of 0.01 dB in m, and 
2.8 percent in op. 

We now address the third issue raised at the beginning of this 
section. To show that the way in which we combine the variables has 
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Table Il—Error in estimate of variance 
No. of Components (K) 








Percent Error 2 4 8 16 32 
o, = 10dB 0.13 1.4 5.4 12.8 21.4 
o, =6dB 0.43 2.0 3.3 8.2 11.5 


little effect, we permuted the order in combining the nine components. 
The resultant mean estimate varied between —0.59 and —0.64 dB, 
while the standard deviation varied between 3.66 and 3.89 dB. This 
example—and a number of others studied—illustrates the desired 
numerical robustness of the procedure. 

Example 3—Eighteen components: This example would correspond to 
two tiers of interferers in a cellular mobile radio scheme. 


Components 1 to 6: mz,=10dB, o, = 10dB. 
Components 7 to 12: m, = —2dB, o, = 10 dB. 
Components 13 to 18: m, = —8dB, ox = 10dB. 


The Monte Carlo results are: m, = 27.07, o, = 4.54, and our technique 
gave mM, = 27.04, op = 4.26. The error is 0.03 dB in the mean estimate 
and 6.2 percent for op. 


V. CONCLUSIONS 


In this study, we have verified once again, as others have, that the 
sum of a moderate number of log-normal random variables is well 
approximated by another log-normal variate, especially in the cdf 
range of 1 to 99 percent. Perhaps more important, we have been able 
to relate the mean and variance of the resultant (Gaussian) power sum 
to the first two moments of the individual underlying Gaussian com- 
ponents. Our method is highly accurate in the range of parameter 
values of most practical interest. 

Based on our preliminary simulations, we may conclude that the 
analytical method presented in this paper is extremely accurate in 
evaluating the mean of the power sum for the complete range of 
parameter values investigated. This range was a dB spread of 2 = 
ox = 14 GB and up to K = 30 components. It is also accurate in 
estimating the standard deviation with up to eight equal log-normal 
variates when o; is less than 10 dB. Outside this range, the approxi- 
mation is slightly less accurate. 


APPENDIX 


Derivation of First and Second Moments 


Our purpose is to provide enough detail so that an interested reader 
can rederive the formulae for the first and second moments, as given 
below by eqs. (48) and (78). 
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The expansions we use are 
(—1)**! 
k 





In(1 +x) = ¥ Cyx*, |x|<1 and Cp= 
k=1 


In?(1 + x) = Yo dax*4, 
k=1 


9(—1)**! # 
Se ge 


<1 d b=——— 
|x| an ke hal fi 


(36) 


(37) 


The density function of a normal random variable with arbitrary mean 


and variance is defined by 





| er ee 
®’ (x; m, 0) = ge 
270 


(38) 


The distribution function is a simple parameter function and is given 


by 


samme 


o(? —") = [ ; ®’(t; m, o)dt = = "Pat. 


The function ®(x) is related to the error function by 





x 


2 
erf(x) = edt 
T Jo 


D(x) = 4 + % erf(x/V2). 


Three integrals which occur often in the development are 


+00 
E{e™} = i e*®' (x; m, o)dx = e™*7"0?/? 


F —m—or 
| e”*@’ ( x3; m, o) dx = [ grninyp( mao) 


co se = o 
| e-"*@'(x; m, o)dx = [e7""*" °’ 0("="*), 
0 


We repeat the definitions introduced in Section ITI: 
YW =L=1,+1L,=e"%+e” 
Z = In(e™ + e%), 


with the Y; Gaussian random variables. Let 


(39) 


(40) 


(41) 


(42) 


(43) 
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w= Yo-— Y; 
Mw = W = E(w) = my, — my, 


ow = E(w — w)’ = a5, + oy. (44) 


For convenience, we shall let f(w) denote the particular Gaussian 
density function 


f(w) = ®'(w; Mw, ow); (45) 
Z is rewritten as 
Z = In(e™ + e*) = Infe*(1 + e”)]. (46) 
Taking the required expectations 
E{Z} =m. = my, + E In(1 + e”) 


= My, + In(1 + e”)f(w)dw. (47) 


As discussed earlier, the integral is broken up into the intervals 
(—oo, 0) and (0, ©). Upon application of the series (36) and the 
averaging (42) and (43), we obtain the result 


Mw Ow Ke 
Mz = My, + m.o( 2) o— en mi! 20% 
0; 





w V2Q7 
co ms a 2 
+F Cie etme (Me Hee) (48) 
k=1 w 


+ grime ( Me Bre — “) i 
Ow 


All the terms, except m,,, are lumped together and denoted by 
Gi(ow, Mw): 


Mz = My, + Gil(dw, Mw). (49) 


The function G, is shown graphically in Fig. 5. 
The second moment is decomposed as 


E(Z’) = E{{Yi + In(1 + e”)}} 
= oy, + m5, + E[2Y, In(1 + e”)] + Efin’(1 + e”)]. (50) 
First, consider the cross term 
A= E{[2Y, In(1 + e”)], (51) 


where Y; and w = Y2 — Y; are jointly Gaussian with correlation 
coefficient 
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Fig. 5—The plots of Gs as a function of o,, with m, as a parameter. 


= |S = My,) (Ww = me | 


p — 
Oy, Ow 
oe 
Ow 
= —oy,/Vo5, + 05, : (52) 


We utilize the conditional expectation property of jointly Gaussian 
random variables (see Ref. 17, Section 7-5): 


E[(¥1 — my,)|(w — m,)] = = (w — mu) (53) 
or 
E[(Yi|w)] = my, — p?(w — mw). 
Then, (51) becomes 
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A = E[2Y; In(1 + e”)] 
= 2E,[In(1 + e”)Ey,|w(¥i)] 
= 2E,,{In(1 + e”)[m,, — p?(w — m.)]}. 


(54) 


For convenience, we denote the first and second terms by A; and Ag, 


respectively: 
A=A,— A2. 
Since 
A, = 2m,,E[In(1 + e”)] 
= 2my,(mz — my,) 
from (47), we need only consider the second term Ag, 
Az = 2p°E[(w — m,) In(1 + e”)]. 
Integrate A2 by parts: 


Ao = 20° (w — mw) In(1 + e”){(w)dw 


+0 ev 
= 20a: —— f(w)dw. 
P Ow [ ite” f(w) 
For negative w, we use the expansion 


Ww 


e 
1+ e” 





a (—1)*e**)” 
2, 


to obtain 
0 Ww 


e 
As = 29°02 
a [= 





f(w)dw 


= 2po2, y (—1)* emule) + (e+1)%01/2 
k=0 
—Mw — Ow k+1 
Ow 


For the other part of (58), we have 


“  e 
As’ = 20702 
. [ 1+ e” 


w 





f(w)dw 


—w 
‘ l+e 


= 2p’02 | ane ny 


POWER SUMS 


(55) 


(56) 


(57) 


(59) 


(60) 
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= 2962 |S (-1)te™(flw)aw 
k=0 


0 


co 
= 20702, »y (—1)*e-hmwth’oi/2 
k=0 


_ 72k 
o| = | 61) 


Ow 
Collecting the results for the cross term, 
A =A, — A2= A; — (A2 + Az’) 
= A; — 2p°G3(ow, Mw) 
= 2my,Gi(ow, Mw) — 2p7G3(ow, Mw). (62) 


The remaining term in E(Z’) to evaluate is the last expression in (50). 
We denote it by Go(ow, mw). 


G2(ow, Mw) = E[ln?(1 + e”)] 


F : 
= | In*(1 + e”)f(w)dw + i In*(1 + e”)f(w)dw 
M ie 


= B, + Bo. (63) 


For the integration over the negative real line, we can use the expansion 
given by (37): 


_ 0 
B= ¥ | ee)" Fw) dw 
k=l i 


foo] 
_ Y Dzelle+Vmw+ (k+1)*0%/2] 
k=1 


yf : (64) 


Ow 


The expression By: is rewritten as: 


Bz = | {Infe”(1 + e~”)]}*f(w)dw 
0 


= i [In*(1 + e~”) + 2w In(1 + e”) + w*]f(w)dw 
0 


= B,;+ Bs + Bs. (65) 


The expressions for B3 and B; are relatively straightforward: 
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B= | w’f(w)dw 
0 


se tea cz) 


+ V2/m7 myoper™/2% 








—MNw —MNwIw ; j 
+oe}1—- o( ) —_———_ eM / 264. (66) 
Ow V2a 


B; = | In?(1 + e”)f(w)dw 
0 


0 
-| In?(1 + e”)f(—w)dw 


0 


bpe**”f(—w) dw 
1 


I 
aor 


7M 8 


_ y bpeW Ft Vmwt (k+1)’ou/2 
k=1 
wi Z k + l 
a 67) 


The final term to consider is 


B= | 2w In(1 + e”)f(w)dw 
0 


foe] 


0 
=-25 a. | we’ f(—w) dw. (68) 


k=] 


Complete the square for the expression e*”’f(—w) and integrate to 
obtain 


fe ree —mMp Ow 2 9 _2 
Be=-2 Oyermatren] mie =) a ein | (69) 
Jn 


k=l Ow 20 
where 
mr = —My + koi. (70) 
Collecting the terms that make up G2(o,, m.) in (63), we have 
G2(o., Mw) = B, + Bo = B, + (B3 + By + Bs) (71) 


as given by eqs. (64), (66), (69), and (67). 
To summarize, the second moment is given by the expression 
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E(Z’) = o5, + m5, + 2 my,Gi(ow, Mw) 
— 20°G3(ow, Mw) + Ge(ow, Mw), (72) 
and, finally, the variance is 
o2 = E(Z’) — mz 
= 03, — Gi(dw, Mw) — 2p°Gs(ow, Mw) + Ge(ow, Mw). (78) 
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To approximate functions of a single variable by using linear 
interpolation is routine in empirical studies. Here, we consider ap- 
proximating functions of several variables in a similar piecewise 
linear manner. We focus on the nontrivial part of this technique, 
which is that of choosing the appropriate “pieces” for the piecewise 
linear approximation. Precisely, we seek to identify the best interpo- 
lants to use at a point of interpolation. This is not an issue for 
functions of a single variable, since the linear ordering of the number 
line leaves us with no choice. For functions of several variables, we 
propose several simple tools to help uncover undesirable choices. The 
techniques presented are useful in the empirical study of quantita- 
tively complex functional relationships whose qualitative behavior is 
nevertheless known and simple. Response time relationships para- 
metrized by workloads in computer performance modeling often fall 
into this category, and an actual bivariate function of this type is 
used to motivate the development. 


l. INTRODUCTION 


Linear approximation is a popular and economical way to gain 
appreciation of the behavior of functional relationships. Especially in 
higher dimensions, making sense out of a sample of data points in 
terms of the underlying multivariate relationship is greatly facilitated 
by some form of piecewise linear approximation. Such an approxima- 
tion allows for estimation of the function at values not included in the 
sample, sheds light on the activity of the function at selected neigh- 
borhoods, and identifies regions where the relationship behaves inter- 
estingly. 

For the sake of concreteness let us consider a typical problem. A 
simulation model of a computer system has been given. The workload 
driving the system is described by two variables giving the respective 
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percentages of two classes of users in the user population; there are 
three classes of users altogether. The two variables, then, form a 
bivariate parametrization of the workload. Six simulations were run, 
and the resulting mean response times are shown below. (This response 
time function will be denoted by A throughout this paper.) 


h(60, 25) = 5.5 
h(60, 7) = 2.1 
h(40, 30) = 1.2 
h(40, 7) = 1.0 
h(20, 25) = 0.7 
h(20, 15) = 0.7 


It will be necessary to estimate the response times for many more 
workloads than for the six simulations already run. Yet, it would be 
inefficient to make a run for each possible parameter pair (x, x2). It 
would also not be necessary, given the qualitatively simple relationship 
that generally exists between workloads and response times of com- 
puter systems. In this particular case, the function is expected to be 
monotone. Therefore, the six simulations not only give us the values 
of the function at those six points, but also the values between and 
around them, that is, at least approximately. For example, we could 
safely assert that (55, 10) should lie between 1.0 and 5.5. In fact, it 
would be reasonable to estimate the range to be from 1.8 to 5.5. This 
becomes obvious by plotting the six-parameter pairs used in the 
simulations, plus the point P = (55, 10) on the (xi, x2) plane, as shown 
in Fig. 1. The points A through F are labeled in the order in which 
they were collected; point A is the oldest. Since the polygon with 
corners A, D, B, and E bound the point (55, 10), it is reasonable to 
deduce that the values of h at A, D, B, and E bound the value of h at 
(55, 10). This gives the interval (1.0, 5.5). Furthermore, A(55, 7) may 
be estimated as 1.8 by linear interpolation between h(40, 7) = 1.0 and 
h(60, 7) = 2.1. Since h is increasing in both x; and x2, h(55, 10) 2 
h(55, 7). This gives the sharper interval (1.8, 5.5). 

As we will see, this approach can be continued until a numerical 
estimate for h(55, 10) is reached. Two steps are required. The first 
step identifies the data points that contain relevant information about 
h(55, 10). For example, we have already rejected C and F as irrelevant 
to P. This was simple and was done “by eye.” Finer methods need be 
developed, however, to determine which one of A, B, D or E should be 
further discarded. The major part of this paper, starting with Section 
III, deals with this and related problems. The second step consists of 
fitting a linear function over the data points chosen as a result of the 
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B = (40, 30) 
x9 


pea oe) D = (60, 25) 


C = (20, 15) 


E = (40, 7) A = (60, 7) 





xy 


Fig. 1—Data points for case study. 


first step; the mechanics of linear fitting will be reviewed in Section II. 
Together, the two steps produce the “best” way to fit the data points 
in a piecewise linear way. The concluding recapitulation gives an 
overview of the methodology in an iterative context. 

Formulas used in our treatment are gathered in a sequence of 
propositions. Since they are rather straightforward and our interest is 
in their usage, only abbreviated arguments for their validity are 
included. All calculations may be easily carried out on a computer. 
Needed, aside from basic arithmetic, are routines to define and manip- 
ulate matrices, perform matrix multiplications and inversions, take 
determinants, and find eigenvectors of symmetric matrices. Compu- 
tations for our examples have been carried out with relatively short 
programs in the S statistical package.’ 

Our proposed approach is not meant to replace the standard practice 
of fitting a single, global functional relationship to the data. The two 
procedures reveal different aspects of the data. Function fitting, being 
inherently global, is well suited for capturing the overall behavior of 
the relationship. This we hope to complement with the piecewise 
linear approach, which, being inherently local, is better suited for 
pinpointing places where interesting things happen to the function. 
This is especially relevant for iterative empirical studies, where it is 
useful to know where the function is active in the parameter space so 
that further experiments can be fruitfully specified. 


ll. THE BASIC INTERPOLATION 


Linear interpolation for functions of a single variable is usually 
taught in high school in terms of such notions as similar triangles and 
slopes. For the purpose of generalization to higher dimensions, an 
alternative, though algebraically equivalent, viewpoint is preferred. 
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Specifically, let f(x1) = 1, f(xe) = ye, and x1.< x < Xe. It can be shown 
that the usual linear interpolation gives the estimate 


Qiyi + a2y2, 
where a; and ae satisfy 
A,X, + O2X2 = X, 
a +a2=1, 
01, a2>0. 


In other words, to estimate f(x) for some x in the interval x; < xo, 
express x as a convex combination (weighted average) of x; and xe, 
then estimate f(x) as the same convex combination of f(x:) and f(x2). 
Notations: As usual, R” is the set of n-dimensional real column vectors. 
Also standard is the use of the prime notation for matrix transposition, 
as in M’. If the jth column of matrix M is m,;, we write M = 
(mi, ---, m,). The length of a vector v = (ui, -++ , Un)’ is denoted by 


lvl] = V(vi +--+ +07). 


For any x € R’”, let x* € R”*’ denote 


(i) 


i.e., the vector consisting of x followed by the singleton 1. If fis a 
function, f denotes its approximation by linear interpolation. 

A higher dimensional analogue of having the points x1, x2 generate 
an interval in one dimension is having points x1, X2, x3 generate a 
triangle in two dimensions and having x1, X2, X3, Xs generate a pyramid 
in three dimensions. Just as we require that an interval in one dimen- 
sion should not shrink to a point, we require that a triangle 
should contain area on a plane rather than reduce to a line segment, 
and that a pyramid should contain volume in solid space rather than 
collapse onto some plane. In general, we are concerned with the convex 
hull of (nm + 1) points xi, +++ , Xn+1 © R”, subject to the condition that 
these points do not le in some lower (<n) dimensional hyperplane. 
The matrix formulation of this geometric requirement is that the linear 
transformation X given by 


XK = (Ki — Xn41, ¢°*, Xn — Xn41) 


should have the trivial kernel, or, by the rank-nullity theorem, satisfy 
det(X) ¥ 0. 


Definition: An n-dimensional pyramid is the convex hull of a set of 
(nm + 1) vectors {X1, +++, Xn+1} C R” such that 


det(K:1 — Xn+1, °+*, Xn — Xnti) FO. 
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The points x; are called its vertices, and any subset of n vertices forms 
a face of the pyramid. 

We will often call a set of points a pyramid, rather than a convex 
hull generated by these points, as would be strictly correct; however, 
this makes no difference since there is a one-to-one correspondence 
between the set of vertices and the pyramid it generates. 

Proposition 1: Let f: R” — R be a function of n variables. Let 
{Xi, +++, Xn+i1} C R” be (ie, generate) an n-dimensional pyramid 
containing x € R”, and let f(x;) = y;. Then, 

(i) X* = (xf, ---, xnei) is an invertible (n + 1) X (n + 1) matrix, 
and 

(tt) the estimate f(x) = (yi, +++, Ynt1)(X*) 'x* is a higher dimen- 
sional generalization of linear interpolation for a single variable. 
Proof: 

(i) If X* were not invertible, its columns would be linearly depend- 
ent. Therefore, one of the vertices would be a convex (not necessarily 
positive) combination of the n remaining vertices, implying that the n 
+ 1 points lie in (7 — 1)-dimensional hyperplane at most. This contra- 
dicts the definition of a pyramid. 

(it) Let a = (X*)7'x* = (a1, +++, Qn+i)’. Since x* = X*a, it follows 
from the star (*) definition that 


XK = aX) + ee) + Qn41Xn+1, 
Qt ee) + Qn41 = 1. 


Thus, a expresses x as a convex combination of {x1, --- , Xn+i1}. Since 
x is contained in the convex hull, a = 0. Hence, the linear interpolation 
at x should be 


f(x) = a y1 +ieee + Qn4+1yn+1, 


which is as proposed. 

In practice, we will not know whether x is contained in the convex 
hull. Therefore, the vector of coefficients a should be explicitly com- 
puted to check that a 2 0. The use of the procedure with negative 
components in a corresponds to linear extrapolation. 


Example: Returning to the computer system response time function 
h, we see from the (x1, x2) plot in the previous section that 


w.2.a)-{(2). (*). (%)} 


forms a two-dimensional pyramid, or triangle, containing 


p= (8). 
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Alternatively, if plots are not feasible, as would be the case in higher 
dimensions, we calculate 


at hee 40-60 


oe 7) = 460 #0, 


checking that a noncollapsing pyramid is obtained, and 


| 40 40 60\~ /55 0.13 
a={30 7 7 10 |} ={ 0.12 }=0, 
111 1 0.75 
checking that 
55 
10 


is contained in the pyramid. Interpolation according to the formula 


gives 
0.13 
(1.2 1.0 2.1) | 0.12 | = 1.85. 


0.75 


.a.r)={(*), (#) (2)] 


is a pyramid containing 
55 
10)’ 


and interpolation using this pyramid gives 


40 60 60\~' /55 
(10 21 55){ 7 7 25 10 } = 2.39. 
Lt! ai 4 1 


Similarly, 


lll. THE SELECTION PROBLEM 


We have just seen how two different choices of pyramids can lead to 
two markedly different approximations. Taking the average and ap- 


proximating 
- (55 
(i) 


by %4(1.85 + 2.39) = 2.12 is not justified. An average is appropriate 
when summarizing a batch of numbers that have been made different 
by random error, since the averaging process “zeroes out” the random 
errors. In our case, the difference between the two approximations 
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does not arise from random error. One of the choices is better than the 
other, and that is not a probabilistic phenomenon. 

Another way to proceed is to put greater trust on closer interpolants 
than on further ones and choose the pyramid yielding the least- 
summed distance from its vertices to that point. Under this criterion, 
the pyramid ADE should be preferred over ABE, since D is closer to 
P than B is by nine units. Unfortunately, the notion of distance is not 
invariant to the choice of scales on the axes, and the criterion becomes 
sensitive to changes of units in the independent variables. The dis- 
tance-type criteria have a further, more fundamental weakness, which 
they share with, among others, the criterion of Lawson.” 

As used by Akima? in his algorithm for bivariate smooth interpola- 
tion, Lawson’s criterion triangulates the (xi, x2) plane in such a way 
that minimum interior angles of triangles are maximized. The objective 
is to set up as many “fat” triangles as possible. Under such “fatness- 
type” criterion, ADE is again chosen over ABE, since ABE comes with 
the companion triangle ABD, which is too “skinny.” An extremely 
skinny triangle is undesirable for two reasons. First, its X* may be 
numerically unstable to invert. Second, it biases the estimation along 
a particular direction in the (x), x2)-plane. (On the other hand, we will 
see in Section VI that thoughtful biasing may be beneficial.) It is not 
necessary, however, to make fatness an overall criterion just to avoid 
such excesses. 

The problem with these distance- and fatness-type criteria is that 
only the configuration of the x variables is used in assessing pyramids. 
(With the curve-fitting approach that uses Lagrangian and trigono- 
metric polynomials,* even this information is not taken into account.) 
This is an unnecessary restriction. Surely the values of the function at 
the interpolants, the f(x)’s, have much to say on the adequacy of an 
interpolation. Our basic working tool to tie x together with f(x) is the 
notion of steepest ascent. Using it, we show that the correct pyramid 
to choose is ABE, rather than the pyramid ADE, favored by both 
criteria above. Incidentally, the ABE choice for (55, 10) was confirmed 
empirically by an actual simulation run which gave the result 
h(55, 10) = 1.74. Recall that interpolation with ABE gives 1.85, 
representing a 6.3 percent error, while interpolation with ADE gives 
2.39, representing a 37.4 percent error. The fact that the interpolation 
overestimates could also have been predicted. See Section VU, where 
assumptions underlying our approach are discussed. 


IV. STEEPEST ASCENT 


Equipped with the mechanics of basic interpolation, we may analyze 
in detail the function’s interpolated behavior on a given pyramid. We 
would like to know along which direction inside the pyramid the 
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function increases, and how fast that increase is. Because of the 
linearity of the interpolation, both questions have well-defined an- 
swers. The geometric way to derive an answer is to slice the pyramid 
into parallel “contour hyperplanes,” which are systems of hyperplanes 
such that points on the same hyperplane have the same interpolated 
values. 


Example: Figure 2 shows some contour hyperplanes (contour lines in 
two dimensions) corresponding to the pyramid (triangle) ABE. Points 
along the downward-sloping lines take the same interpolated values as 
labeled. Clearly, A increases most rapidly along the direction of the 
arrow, which is perpendicular to the contour lines. 

Since the contour hyperplanes are parallel, a single vector perpen- 
dicular to all of them can be found. Actually, there will be many such 
vectors, but they can only have one of two opposite directions, one for 
increasing f and another for decreasing f. We agree to take the 
increasing direction. Being perpendicular to the contour hyperplanes, 
this direction has no component along which ; does not increase. 
Therefore, it is the direction along which f increases the fastest. We 
define a unit vector in this direction to be the direction of steepest 
ascent of the function fin the pyramid. The rate of steepest ascent is 
the increase of f along the direction of steepest ascent per unit distance 
traveled. 

It remains to formulate these geometric notions in matrix terms. 


Proposition 2: Let f: R” — R be a function of n variables, {xi, ---, 
Xn+1} C R” be a pyramid, f(x;) = yi, and X* = (xt, «++, Xn+1). Let 
(ai, Se Rery An+1) = (y1, ria | Yn+1) (X*)7. 


(t) The direction of steepest ascent in {x1, «++, Xn+i} IS (@1, ++-, 
an)/\|(ai, +++, @n)||, and 
(ii). The rate of steepest ascent in {xi, +++ , Xn+1} is ||(ai, «++, an). 





Fig. 2—Contour lines for ABE. Function increases in direction of arrow. 
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Proof: By definition of (a1, «++, @n+i), the interpolation fits the 
function 


F(xa, +++, Xn) = AX, + ee) + AnXn + Ans 


on the pyramid (x, +--+, Xn+}. 

To show (z), note that f(u) = flv) implies that (a, ---, an) (u — v) 
= 0. To show (i), note that f((a1, ---, an)/|\(ai, «++, an)||) —f (0) = 
(ai, +++, @n)Il. 

Example: Table I lists some directions and rates of steepest ascent 
that we will use later. 


V. A FIRST APPROACH: JUDGING BY FACES 


Example: We are now ready to reject ADE in favor of ABE. Actually, 
since ADE and BDE form a companion pair, we will reject them both. 
This will be accomplished by analyzing the face DE. In Fig. 3, ADE 
and BDE are shown along with their respective directions of steepest 
ascent loosely placed about their centers. The two directions of steep- 
est ascent make sense separately. Both show response time as an 
increasing function of x; and x2. Together, however, they show that 
DE is a ridge, i.e., A increases as DE is approached from either side. 


Table |!—Some directions and rates 
of steepest ascent 


Rate of 
Direction of Steepest 
Pyramid Steepest Ascent Ascent 
ADE (0.29, 0.96)’ 0.197 
BDE (0.99, 0.05)’ 0.218 
ABD (0.81, 0.58)’ 0.323 
ABE (0.98, 0.19)’ 0.059 
B 
x9 
D 
E A 


Fig. 3—ADE and BDE with their directions of steepest ascent. 
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Alternatively, a ridge indicates that A is not monotone along some 
direction on the (x1, x2) plane. From our knowledge of h, no such ridge 
should exist. The triangulation {ADE, BDE} incorrectly shows a ridge 
because A(D) is inordinately large. The triangulation allows D to pull 
up both triangles, causing their boundary to buckle. The correct way 
to picture A is shown in Fig. 4. For low values of x; and x2, A grows 
moderately (rate of steepest ascent = 0.059 on ABE), with x, being 
mainly responsible for the increase. As x: and x2 become large, 
h explodes (rate of steepest ascent = 0.323 on ABD), and x2 begins to 
affect A seriously, although x is still the major contributor. Because 
of the piecewise linearity of interpolation, AB has become an acceler- 
ating, refracting boundary but not a ridge. In Section VI, we will see 
another reason why ADE is a poor choice. 

Note that specific knowledge about A was invoked to make the 
selection: We knew that response time as a function of workload had 
no ridges. In general, linear interpolation of functions with known 
major ridges, valleys, maxima, or minima should be avoided. An 
exception may be made if we are able to sprinkle such tricky terrain 
generously with further interpolants. Under finer resolution, the vol- 
atile formations should diminish. Of course, even if a ridge does exist, 
it is highly improbable that it should coincide with one of our faces. 
Definition: A face is a bad face if there are two pyramids containing it, 
and their directions of steepest ascent either both point towards it or 
both point away from it. 

Thus, we accept as likely those faces that may refract and/or 
accelerate steepest ascent vectors, but we question faces that gather 
or scatter them. 

Matrix techniques are clearly needed to identify bad faces in higher 
dimensions, where graphs are not feasible. It is enough to find a 
technique to determine if a given steepest ascent approaches a given 


B 





Fig. 4—ABE and ABD with their directions of steepest ascent. 
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face. Since approaching a face at 90 degrees is different from approach- 
ing it at 0.0001 degree, the angle of approach is also of interest. Note 
that knowledge of the angle alone does not tell us whether the ascent 
approaches; the ascent may be located on either side of the face. 
Proposition 3: Let a € R” be a steepest ascent for the pyramid 
{X1, -+*, Xn4r} C R”. Let {xi, ---, x,} = F be the face of interest. 
Define the n X (n — 1) matrix 


X = (Ki — Xn, X2 — Xn, *°*, Xn-1 — Xn). 
Also define the n X n matrix 
Y = (—X, a) = (Kn — Xi, Xn — Xo, ¢¢*, Xn — Xn-1, A). 


(i) Let B= (Bi, «++, Bn)’ = Y | (Xn — Xn41). If Bn > 0, a approaches 

F. If 8, < 0, a recedes from F. If Y is not invertible, a is parallel to F. 
(ii) Let cos 6 = Va’ X(X’X)"'X’a. Then @ is the angle between a 

and F. 

Proof: 

(t) The trick here is to pull the vertex x,+4, to the origin so that we 
may examine the relationship between a, now sitting at the origin, and 
F, now translated to (Xi — Xn4i, +++, Xn — Xn+1}. For B as given above, 
let 7; = B: fori =1,---,n—-—1, am =1— (m+ +++ + m-1), andt = 
Bn. The reader may check that 


n 


ta = ¥ mi (Xi — Xnt), 


i=l 

n 
m=. 
i=l 


The interpretation of the sign of 8, follows from the trick just de- 
scribed. 

(iz) The trick is to translate the face to the origin by x,. We then 
regress a against the translated face {x) — Xn, +++, Xn-1 — X,} so that 
its projection proj(a) onto the linear subspace spanned by the trans- 
lated face may be found by the usual formula. Now take the inner 
product between a and proj(a), bearing in mind that a is a unit vector. 
Example: Using the formulas above, we find the DE is approached by 
the direction of steepest ascent of BDE at 31 degrees, and DE is also 
approached by the direction of steepest ascent of ADE at 39 degrees. 
Thus, DE is a ridge. 

VI. ANOTHER APPROACH: THE AXES CRITERION 


Example: Let’s consider again the selection problem with the response 
time function y = h(x1, x2), but this time we consider the complemen- 
tary pair of triangles ABD and BDE, shown in Fig. 5. While neither 
triangle is very skinny, we could roughly identify directions along 
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which they stretch out. For example, ADB stretches out in some 
direction along AB, which is quite perpendicular to its direction of 
steepest ascent. BDE, on the other hand, stretches somewhat along 
DE, which is not at all perpendicular to its direction of steepest ascent. 
This difference affords another basis for selection. Let’s exaggerate 
somewhat and give the two triangles more definitive and mutually 
perpendicular directions of stretch; we also assume that the “direction 
of steepest ascent of the function” (a vague notion), denoted by a, is 
perpendicular to the stretch of ABD. The situation is shown in Fig. 6. 
We may now rotate the configuration to new axes (x1, x) so that the 
pair of triangles stretch in directions parallel to the new axes, as in Fig. 
7. Because of our assumption, a becomes parallel to x}. This means 
that x has no effect on y, so we may plot y as a function of x} alone. 


x9 


a, = ASCENT IN ABD 
a, = ASCENT IN BDE 


Fig. 5—Triangles ABD and BDE with directions of steepest ascent. 





xy 


Fig. 6—ABD and BDE stretched in mutually perpendicular directions. 
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Since a points along x}, y is an increasing function of x}. If y is a linear 
function of x{, it will also be a linear function of (x1, x2), in which case 
it doesn’t matter which triangle we select. Both will give perfect fit. 
However, for h, we must protect ourselves against nonlinearity. We 
actually know that h is concave downwards as a function of x}. (If h 
is concave upwards, the same argument below holds.) The elimination 
of x2 leads to the (xi, y) plot given in Fig. 8. Clearly, ABD hugs the 
function much better than BDE. By stretching linearly far along the 
direction of ascent, BDE loses touch of the underlying, nonlinear 
function. Here is the second reason why ABE is a better choice than 
ADE. 

The moral is that triangles perpendicular to their directions of 
steepest ascent give better estimates. The same is true in higher 
dimensions, although in higher dimensions it is not appropriate to talk 
about the direction of stretch of a pyramid because a pyramid may 
stretch in several directions at once. In three dimensions, for example, 





Fig. 7—Rotation of Fig. 6. 





Fig. 8—Elimination of x2. 
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a solid pyramid may stretch out in one single direction, as in a needle- 
like pyramid, or in two directions, as in a pancake-like pyramid. 
Therefore, we will appeal to the notion of the axes of a pyramid—the 
system of coordinates that best aligns with the vertices of the pyramid, 
such that the first coordinate aligns with the furthest stretch of the 
pyramid, ---, the last coordinate aligns with the shortest stretch of 
the pyramid. The coordinates (x1, x2) found above by rotation are an 
informal, visual example. Formally, it is necessary to define “best 
alignment.” For convenience, we will define it in terms of least-squared 
distances. Then, we may use the principal component technique from 
multivariate statistical analysis.° 


Definition: Let {x1, +++, Xn41} C R” be a pyramid with mean vector 





x= (x, + e++ + Xp41). 


n+1 
Define the n X (n + 1) matrices 


X = (Xi, +++, Xn), X = (X, ---, X). 


Define the n X n covariance matrix 
1 = = 
Va=~ (KX — X)(X— XY’ 


Let vi, «++, Vn be the eigenvectors of V with eigenvalues A; 2 --- 2 
An. (That is, Vv; = A;v;:.) Then v; is the first axis of the pyramid, v2 is 
the second axis of the pyramid, --- , Vn is the last axis. 

The eigenvalue 4; tells us how far the pyramid stretches along v;. 
Thus, a four-dimensional pyramid with eigenvalues 1000 = 102102 
10 stretches out mainly along vi, while a four-dimensional pyramid 
with eigenvalues 1000 2 500 2 10 2 10 stretches mainly along v; and 
V2, with the stretch along vi being significantly larger than the stretch 
along v2. A pyramid with identical eigenvalues does not stretch in any 
direction and is “fat.” The eigenvalues cannot be 0 for pyramids. 

The use of the axes v), ---, V, in evaluating a pyramid is as follows: 
if a is its direction of steepest ascent, then the ideal pyramid 
{X1, +++, Xn+1} satisfies 

a’v) =0 


a’Vn-1 = 0. 
Thus, its first (n — 1) axes are perpendicular to the direction of steepest 
ascent, leaving the last axis, the direction of shortest stretch of the 


pyramid, to coincide with its direction of steepest ascent. When com- 
paring less-than-ideal pyramids, we choose the one where | a’v;| is close 
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to 0 for as many of the first vi, --- , Vm as possible. Note that a’v; is 
the cosine of the angle between a and vj. 

Example: Using the formulas given above, Table II has been com- 
pleted. Note that vi and v2 are mutually perpendicular, as they must 
be. From the relatively high ratios of A;/A2, it follows that all four 
triangles have fairly prominent major (first) axes, i.e., they stretch 
along some distinct direction. ABD and ABE form the narrower pair 
of triangles. By comparing |a’v;|, it follows that ADE and BDE tend 
to stretch along their respective directions of steepest ascent, while 
ABD and ABE are more perpendicular to their respective steepest 
ascents. Therefore, we chose the triangulation {ABD, ABE} over the 
triangulation {ADE, BDE}. 


Vil. ASSUMPTIONS 


Now we spell out some assumptions about the underlying function 
that are relevant to the validity of our linear interpolation. 

The foremost assumption is that the function is reasonably smooth 
with respect to the mesh of the interpolants. Without such basic 
optimism, which is buttressed by the prerogative of adding further 
interpolants to refine the mesh, systematic investigation could not 
proceed. As described in Ref. 6: “The experimenter is like a person 
attempting to map the depth of the sea by making soundings at a 
limited number of places. --- mapping a surface resembling a nest of 
stalagmites or the back of a porcupine would be impossible --- since 
characteristics of the surface at one point would not be related to 
characteristics elsewhere.” Under the smoothness assumption, the axis 
analysis presented in Section VI is valid, since it simply recommends, 
in quantitative terms, the prudence of spacing out the interpolants 
only where the function appears to behave uneventfully. 

The face criterion of Section V applies when the function is mono- 
tone, in addition to being smooth. (Monotonicity as defined here is 
more stringent than what is usually required.) 

Definition: A multivariate function f: R” — R is monotone if one of 
the following holds for any x, y € R”: 

(i) For all a, B € (0, 1), a> B > flax + (1 — a)y) 2 f(bx + 
(1 — B)y), or 


Table II—Axes analysis for typical problem 


Trian- Steepest As- Kigenvalues 

gle cent a Eigenvectors uv, v2 Aa, A2 a’, 
ADE (0.29, 0.96)’ (—0.78, —0.63)’, ( 0.63, —0.78)’ 182.0, 59.3 —0.83 
BDE (0.99, 0.05)’ (—0.65, —0.76)’, ( 0.76, —0.65)’ 183.7, 96.0 —0.68 
ABD (0.81, 0.58)’ ( 0.68, —0.73)’, ( 0.73, 0.68)’ 233.4, 46.3 0.13 
ABE (0.98, 0.19)’ (—0.60, 0.80)’, (—0.80, —0.60)’ 234.5, 75.2 —0.44 
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(ii) For all a, B € (0, 1), a > B > flax + (1 — a)y) S f(Bx + 
(1 — B)y). 

That is, f either increases or decreases consistently along whichever 
direction one travels in the x-space. 

Often, specific knowledge of the function includes its monotonicity. 
With our A, for example, it is unlikely that any direction should be 
singled out along which h “rollercoasts.” 

A further useful assumption is convexity (concavity). 

Definition: A function f: R” > R is convex on a convex region A C R” 
if for any x, yE A and aE (0, 1), 


f(ax + (1 — a)y) = af(x) + (1 — a) f(y). 
The function fis concave if 
flax + (1 — a)y) Zaf(x) + (1— @) f(y). 
If fis convex on the region A, the interpolated function/ satisfies 
f(x) S f(x) (KE A). 


(Similarly, if f is concave, we have f = f) Hence, knowledge of the 
convexity (concavity) of f permits the conclusion that the interpolated 
approximations are the upper (lower) bounds to the true values of f. 
Since / appears to be convex on the hexagon ABCDEF,, the response 
time estimates obtained from linear interpolation are likely to be 
worst-case estimates. This agrees with actual data: 


h(55, 10) = 1.74 S 1.85 = A(55, 10). 


Proposition 4: Let ||a|| be the rate of steepest ascent in pyramid P = 
{X1, +++, Xn+i} C R”. Let rn, ---, 7; be the rates of steepest ascent in 
all pyramids that share some face F with P such that a recedes from 
F. Similarly, let s1, ---, s; be the rates of steepest ascent in all 
pyramids sharing some face with P that is approached by a. Then, for 
monotone f: 

(t) It is consistent with the convexity of f on P that 


r,s, nSiall ssi, ---, s;. 
(ii) It is consistent with the concavity of fon P that 
r++, 72 |lal] 2 si, ---, 5). 
Example: It follows from Fig. 9 that A is likely to be convex on ABE, 
since 0.025 < 0.059 < 0.323. 


Vill. MORE ON THE SECOND APPROACH: A GLOBAL METRIC 


Selecting the “best” system of pyramids was straightforward in our 
example with ABE and ADE, since the number of triangles was few 
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Fig. 9—Convexity of h. 


and there was no trade-off between the two systems. When comparing 
systems of pyramids involving possibly dozens of pyramids each, and 
when each system contains both “good” and “bad” pyramids, individ- 
ual assessments of pyramids must be combined into an overall measure 
to facilitate the comparison. A combined, global measure of the opti- 
mality of a system of pyramids is also necessary for the application of 
our procedure on a computer. We now derive such a metric under the 
principles discussed in Section VI. 

Definition: Let P = {xi, +++, Xn+1} C R” be a pyramid with associated 
steepest ascent vector a. Let 4; 2 --- 2 A, > 0 be the eigenvalues 
derived from P as in Section VI, and let v;, --- , v, be the correspond- 
ing eigenvectors. Define 


Ms 


» Aile’vi| 
m(P) a =. 
di 


i] 


Ms 


1 


~. 
i} 


In the above expression, m(P) is intended to measure the departure of 
P from optimality. From the weighted-average form of its definition, 
it follows that a large value of m(P) is caused by large values of | a’v;| 
for the larger values of 4;. Equivalently, it implies alignment of the 
major axes of P along a. An optimal P, of course, will do the opposite: 
As seen in Section VI, it will align its least-significant axes along a. 
Definition: Let {Pi, ---, Pn} be a system of pyramids. Let Vol(P;) 
denote the volume of P;. [Recall that Vol({x1, +--+, Xn+i}) = 
det({xi, «++, Xn+1})/n!; see (Ref. 7, p. 331).] Define 


M({Pi, +--+, Pm}) = y Vol(P;)m (Pj). 
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The expression M({Pi, --- , Pm}) is the proposed overall metric of the 
departure of the system {Pi, ---, P,} from optimality. It sums the 
nonoptimality of each of its constituent pyramids, giving greater weight 
to the more voluminous pyramids. Note that it is not necessary to 
normalize M by the sum 


> Vol(P;), 

J=1 
since two comparable systems of pyramids will cover the same total 
volume. 


Example: Applying the definitions above, 
m(ADE) = (182.0*0.83 + 59.3*0.56)/(182.0 + 0.56) = 0.765, 


60 60 40 
Vol(ADE) = det} 7 25 7{| = 180. 
1 1 i 


m(BDE) = 0.695, — Vol(BDE) = 230. 
M({ADE, BDE}) = 297.42; 

m(ABD) = 0.269,  Vol(ABD) = 180. 

m(ABE) = 0.492,  Vol(ABE) = 230. 
M({ABD, ABE}) = 161.85. 


Since M({ABD, ABE}) < M({ADE, BDE}), our calculation with the 
metric M agrees with our earlier conclusion that {ABD, ABE} is the 
preferred triangulation over {ADE, BDE}. 


IX. BUILDING PYRAMIDS 


One practical issue remains. In actual empirical studies, vertices 
become available one by one, and except when the first pyramid is 
formed, the pyramid building process is always applied to an existing 
system of pyramids. Especially in higher dimensions, we need an 
efficient and consistent method to incorporate new vertices into old 
systems of pyramids. 

Example: Suppose the vertices A, B, C, Din our computer performance 
example have been satisfactorily triangulated, and EF appears as a new 
vertex, as shown in Fig. 10. What new triangulations are generated so 
that they may be compared? If the new vertex is inside some triangle 
[e.g., H = (55, 10) inside ABE], then an obvious reasonable answer 
exists (e.g., replace ABE by HBE, AHE, and ABH). In higher dimen- 
sions, the procedure is just as simple, so hereafter we will only consider 
new vertices not in the convex hull of the old vertices. We should 
specify that we start out with a minimal cover, that is, we start out 
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Xo 


x4 


Fig. 10—Adding a new vertex to a system of pyramids. 


from a system of pyramids containing the convex hull of the old points 
(“cover’’) in such a way that any point is in the interior of at most one 
pyramid (“minimal”). For example, if triangle BCD were added to the 
triangulation above, we would no longer have a minimal cover, since 
some points close to B would be inside two triangles. We would also 
like to end up with a minimal cover after incorporating the new point 
E. By going from minimal cover to minimal cover, the method we 
develop could be used again when a new vertex (e.g., F’) comes around. 
For the sake of consistency, we do not want to add any pyramid 
implicitly rejected by the minimal cover we started out with, such as 
BCD. We call new covers satisfying this desideratum consistent. The 
appropriate building block for the new system of pyramids turns out 
to be faces. 

Definition: Let {xi, ---, xx} C R” be the set of vertices from the 
starting minimal cover, and let p be a point outside its convex hull. A 
new face is the convex hull of {p, x:,, --- ,x;,_,} for any sequence 1 = 
ly < +++ <in-1 SK such that: 


(¢) row rank of (x;, — p, --+,Xi,,, —p) =n — 1, and 
(ii) no xj (J ¥ 11, +++, ini), is in the convex hull of {p,xi,, ---, 
> ae 


Condition (i) ensures that the new face does not collapse to some 
<(n — 1) dimensional surface. It may be checked by counting the 
nonzero eigenvalues of the covariance matrix (see Section VI). Con- 
dition (ii) rules out pathologies such as face DE’ in the two-dimen- 
sional case pictured in Fig. 11. (AE’, of course, is legitimate.) 
Proposition 5: Let Fo be the set of old faces in the starting minimal 
cover, and N be the set of new faces. If S is any consistent minimal 
cover for the new convex hull whose set of faces is denoted by F,, then 

(t) Fis asubset of Fo U N, 
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OLD VERTICES 


@ £’ NEW VERTEX 


Fig. 11—The pathological triangle ADE’. 


(zi) Any two faces from F do not intersect at interior points (though 
they may met at boundaries), and 

(tii) F' is a maximal set satisfying (i) and (iz), i.e., any face not in F 
either is not in Fy U N or meets some face from F at interior points. 
Proof: Check that the faces from any minimal cover must satisfy (iz). 

Thus, assuming that we can enumerate all sets of faces satisfying (7) 
through (iii), we may proceed to reconstruct minimal covers from 
these sets, and Proposition 5 tells us that we would have captured all 
the consistent new minimal covers we want this way. Incidentally, 
Proposition 5 does not rule out the possibility that we may catch more 
than the new minimal covers or that some sets of faces may not be 
reconstructible into new minimal covers. These situations are dealt 
with by the converse to Proposition 5, which remains to be proven. 
Therefore, we should guard ourselves against the first possibility and 
not be surprised at the second. 

The construction of all sets satisfying (z) to (iii) can be done neatly 
along a downward-growing binary tree whose branches enumerate the 
sets. 

Example: Continuing the preceding example, it is clear that Po = {AB, 
AC, AD, BC, BD} and N = {EA, EB, EC, ED}. Next, we list pairs of 
faces from Fy U N that meet at interior points in Table III. We now 
grow our tree. The trunk of our tree contains those faces meeting no 
others. Beyond that, we split the tree into two branches each time we 
choose between a face and any one of the faces it meets. We extend 
each branch as far as possible as long as it does not contain pairs of 
intersecting faces. At the end of each branch we reconstruct a new 
minimal cover from the faces listed on the branch; the result is 
displayed in Fig. 12. Thus, the leftmost branch {AD, BC, BD, EA, EC, 
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Table IIl—Pairs of 
intersecting faces for 
typical problem 
Face Meets Face(s) 


AB ED 
AC EB, ED 
AD _ 
BC — 
BD _— 
EA — 
EB AC 
EC _ 
ED AB, AC 
AD 
BC 
BD 
EA 
EC 
AB SK ED 
ee EB EB 
{aso ACE, ABC {400 BCE, ABE ADE, BCE, BDE 


Fig. 12—Tree from new vertex E. 


AB, AC} lists one of the possible sets of faces that contains as many 
faces as possible without including two that intersect at interior points. 
It is easy to see that the branch is an enumeration of all faces from the 
triangulation {ABD, ACE, ABC}. The other two branches are ob- 
tained similarly. Using the evaluation techniques from Sections V, VI, 
and VIII, the triangulation reconstructed from the faces listed in the 
middle branch was chosen. 

The only step in this procedure, which is nontrivial in higher 
dimensions, is the tabulation of pairs of faces that intersect at interior 
points. Especially where there are many pairs of faces, making a pair- 
by-pair determination can be cumbersome. 


Notation: Let e; € R” be the vector with 0 in every component except 
the ith, where it is 1. 
Proposition 6: Let F = {x1, ---,Xn}, G= {yi, «++, yn} C R” be faces 
from Fo U N. Choose any j such that (x1 — Xn, +++, Xn-1 — Xn, ej) iS 
invertible, and let 

p’(F) = en (x1 — Xn, +++, Xn-1 — Xn, ej)’ E R”. 


(If F is a face, such / exists.) Similarly, define p’(G). Then F' and G 
have empty interior intersections if and only if either 
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(it) p'(F)x. < min{p’(F)yi, ---, p(F)y,} or p(F)x: > 
max{p’(F)yi, ---, p’(F)yn}, or 

(ii) p’(G)y. < min{p’(G)m, ---, p(G)x»} or p(G)yi > 
max{p’(G)xi, ---, p’(G)x,}. 
Proof: The clue is to recognize that p(F) is simply some vector 
perpendicular to F and p’(F)y is the unscaled projection of y onto - 
p(F). Clearly, two faces have disjoint interiors if and only if the two 
faces have disjoint projections along some direction perpendicular to 
one of them. 
Example: We leave the response time function h so that we may 
illustrate application of Proposition 6 in three dimensions. We are 
given six vertices in R®, and we wish to determine how the 20 resulting 
faces intersect in pairs. 


(el) 
“Qe e(3 


Table IV contains the results of the calculations according to Propo- 
sition 6. The last column of the table, where filled, lists the pairs of 
faces with disjoint interiors that have been identified. There are five 
such pairs. For example, the first row shows, using a projection 
perpendicular to ABC, that ABC and DEF do not meet, since 30 < 
min{80, 60, 69}. Note that A, B, and C have the same projection, 30, 
since the projection is perpendicular to ABC. Note also that BCD and 
BCE yield the same rows on Table IV. This is because B, C, D, and E 
lie on the same plane. By Proposition 6, the five other pairs of faces 
must meet at interior points. 

Because only six points were involved, each row in the table could 
only be used to separate two three-pointed faces. If more points were 
involved, more than one pair of faces could be found disjoint at once. 
For example, if we had two additional points G and H, and the first 
row read as in Table V, then the following pairs of faces may imme- 
diately be identified as disjoint: 


ABC, DEF ABH,DEF AHC, DEF 
ABC,EFG ABH,EFG £AHC,EFG 
ABC, DFG  ABH,DFG AHC,DFG 
ABC, DEG ABH,DEG  AHC,DEG,::: 


This method clearly beats having to test each pair separately. 
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Table |\V—Determining how 20 faces meet in pairs 
FaceF p(F)A p(F)B p(F)C p(F)D p(F)E p(F)F 


ABC 30.0 30.0 30.0 80.0 60.0 69.0 ABC, DEF 
ABD 17.5 17.5 5.0 17.5 —15.0 —6.0 ABD, CEF 
ABE 24.0 24.0 18.0 50.0 24.0 33.0 
ABF 22.2 22.2 14.4 41.0 13.2 22.2 
ACD 30.0 5.0 30.0 30.0 85.0 6.9 
ACE 30.0 60.0 30.0 140.0 30.0 69.0 ACE, BDF 
ACF 13.3 0.0 6.7 0.0 0.0 6.0 ACF, BDE 
ADE 22.1 12.9 14.3 22.1 22.1 21.9 ADE, BCF 
ADF 22.2 12.8 14.4 22.2 22.6 22.2 
AEF 22.2 13.2 14.4 23.0 22.2 22.2 
BCD 20.0 10.0 10.0 10.0 10.0 9.0 
BCE 20.0 10.0 10.0 10.0 10.0 9.0 
BCF 20.3 10.5 10.5 11.8 11.3 10.5 ADE, BCF 
BDE 20.0 10.0 10.0 10.0 10.0 9.0 
BDF 20.1 9.7 10.2 9.7 11.1 9.7 ACE, BDF 
BEF 20.4 11.4 10.8 14.0 11.4 11.4 
CDE 20.0 10.0 10.0 10.0 10.0 9.0 
CDF 20.3 9.9 10.5 10.5 11.9 10.5 
CEF 20.3 11.3 10.5 13.3 10.5 10.5 ABD, CEF 
DEF 23.0 14.0 16.0 27.0 27.0 27.0 ABC, DEF 


Table V—Separating many pairs of faces simultaneously 
FaceF p(F)A p(F)B p(F)C p(F)D p(F)E p(F)F p(F)G p(F)H 
ABC 30 30 30 80 60 69 70 25 


X. RECAPITULATION 

The cycle is now complete. We start out with some minimal cover— 
possibly consisting of a single pyramid—on the x-space of some func- 
tion y = f(x). The behavior of the function on the minimal cover may 
be explored by linear interpolation as discussed in Section II. If further 
information is subsequently found wanting, a new interpolant is cho- 
sen, and a new data point is empirically acquired. Using the technique 
from the preceding section, we are able to list all consistent ways to 
extend the starting minimal cover into a new minimal cover that 
incorporates the new interpolant. These new minimal covers should 
be evaluated both in terms of their faces and in terms of the axes of 
their pyramids. For either evaluation, the basic ingredient is the notion 
of steepest ascent. Through this notion the values of the function at 
the interpolants are taken into account. This distinguishes our ap- 
proach from other schemes where only the x-space gets examined. 
Eventually, a best new minimal cover is chosen, and we are ready, 
once more, to interpolate and explore the function. 


REFERENCES 


1. R. A. Becker, and J. M. Chambers, “Design and Implementation of the S System 
for Interactive Data Analysis,” Proc. Computer Software and Applications Chi- 


MULTIVARIATE FUNCTIONS 1485 


cago, Illinois (November 13-16, 1978), pp. 626-9. 

2. C. L. Lawson, “Generation of a Triangular Grid With Application to Contour 
Plotting,’ Tech. Memo. 299, Sect. 914, Jet Propulsion Lab., Calif. Inst. Tech. 
(February 1972). 

3. H. A. Akima, “A Method of Bivariate Interpolation and Smooth Surface Fitting for 
Irregularly Distributed Data Points,” ACM Trans. Math. Software, 4, No. 2 (June 
1978), pp. 148-59. 

. W. E. Milne et al., “Mathematics for Digital Computers,” Vol. 1: Multivariate 
Interpolation, Wright Air Development Center Tech. Rep. 57-556, Aeronautical 
Research Laboratory (February 1958). 

. T. W. Anderson, An Introduction of Multivariate Statistical Analysis, New York: 
John Wiley, 1974. 

. G. E. P. Box, W. G. Hunter, and J. S. Hunter, Statistics for Experimenters: An 
Introduction to Design, Data Analysis and Model Building, New York: John 
Wiley, 1978. 

7. G. Birkhoff and S. MacLane, A Survey of Modern Algebra, 4th ed., New York: 

McMillan Publishing Co., 1977. 


aN 


oO on 


1486 THE BELL SYSTEM TECHNICAL JOURNAL, SEPTEMBER 1982 


Copyright © 1982 American Telephone and Telegraph Company 
THE BELL SySTEM TECHNICAL JOURNAL 
Vol. 61, No. 7, September 1982 
Printed in U.S.A. 
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We analyze a traffic overflow system that consists of two groups of 
trunks, with waiting spaces for each group, and some overflow ca- 
pability from the primary to the secondary group. We consider the 
case in which the number of waiting spaces in the primary queue is 
large compared to the corresponding number in the secondary queue 
and to the number of trunks in the secondary group. The case of an 
infinite number of waiting spaces in the primary queue is also 
allowed. We contrast the approach presented with some previous 
approaches that are suitable when the number of waiting spaces in 
the primary queue is not comparatively large. As with previous 
approaches, the aim is to reduce the dimensions of the system of 
equations to be solved in order to calculate various steady-state 
quantities of interest. Our results include expressions for the loss 
probabilities, the probability of overflow from the primary to the 
secondary group, and the average waiting times in the queues. We 
also obtain the stability condition under which the results are valid 
when the number of waiting spaces in the primary queue is infinite. 


I. INTRODUCTION 


In this paper, a particular traffic overflow system with queueing is 
analyzed. The same system has been investigated previously where 
techniques were developed for reducing the dimensions of the system 
of equations to be solved in order to calculate various steady-state 
quantities of interest.’” This analysis is a considerable improvement 
over the earlier study for the case when the number of waiting spaces 
in the primary queue is large compared to the corresponding number 
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in the secondary queue and to the number of secondary trunks.’ A 
suitable analysis has already been made for the case when the number 
of waiting spaces in the secondary queue is large compared to the 
corresponding number in the primary queue and to the number of 
trunks in the two groups.” 

In cases of interest, the sparse system of linear equations for deter- 
mining the steady-state probabilities of the number of demands in the 
two groups may have very large dimensions. The reduced system of 
equations then has considerably fewer dimensions, but the system will 
be dense. Numerical results based on the solution of the original sparse 
system of equations by successive over-relaxation techniques, and on 
the solution of the reduced system of equations obtained earlier by 
Morrison,’ were presented by Kaufman, Seery, and Morrison.* The 
numerical values of the various steady-state quantities of interest 
obtained by the two procedures agree to many significant figures. 
Other procedures for solving the original system of equations are 
discussed in Refs. 1 and 4. 

The dimensions of the reduced system of equations obtained in this 
paper are independent of the number of waiting spaces in the primary 
queue. On the other hand, the dimensions of the reduced system of 
equations derived in Ref. 1 continue to increase as the number of 
waiting spaces in the primary queue increases. This is even more 
drastically the case for the original sparse system of equations. Con- 
sequently, the reduced system of equations that we derive is advan- 
tageous when the number of waiting spaces in the primary queue is 
large, and even more so when the number is infinite. 

The traffic overflow system considered consists of two groups, a 
primary and a secondary, with n, > 0 servers and gq, waiting spaces, 
which receive demands from independent Poisson sources S; with 
arrival rates A; > 0, k = 1 and 2, respectively, as shown in Fig. 1. The 
service times of the demands are independent and exponentially dis- 
tributed with mean service rate p > 0. If all nz servers in the secondary 
group are busy when a demand from S; arrives, the demand is queued 
if one of the g2 waiting spaces is available; otherwise it is lost, that is, 
blocked and cleared from the system. Demands waiting in the second- 
ary queue enter service, in some prescribed order, as servers in the 
secondary group become free. 

If all mn, servers in the primary group are busy when a demand from 
S, arrives, and there is a free server in the secondary group and no 
demands waiting in the secondary queue, the demand is served in the 
secondary. If there are no free servers, then the demand is queued in 
the primary queue, if one of the q; waiting spaces is available; otherwise 
it is lost. Previously, two different cases were considered for the 
treatment of demands waiting in the primary queue.” In case I, a 
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Fig. 1—Mean flow rates for an overflow system with queueing. 


demand waiting in the primary queue may enter service either in the 
primary group, when a server becomes free, or in the secondary group, 
if a server becomes free and there are no demands waiting in the 
secondary queue. In case II, no overflow is permitted from the primary 
queue, so that a demand in the primary queue must wait for a server 
in the primary group to become free. 

We consider only case I and use a different approach to analyze the 
overflow system. The new approach is preferable to the earlier ap- 
proach’ when q; is large compared to gz and ne, or even infinite. The 
alternate earlier approach’ is preferable when qe is large compared to 
qi, M4, and ne, or even infinite. 

Let p;; denote the steady-state probability that there are : demands 
in the primary group and j demands in the secondary group, either in 
service or waiting. These probabilities satisfy a set of generalized birth- 
and-death equations, which take the form of partial difference equa- 
tions connecting nearest neighboring states. The basic technique is to 
separate variables in regions away from certain boundaries of the state 
space, the elements of which are (1, 7). These regions are shown in Fig. 
2a. The analogous regions corresponding to the earlier analyses” are 
shown in Figs. 2b and 2c. The separation of variables leads to two sets 
of eigenvalue problems for the separation constant. The eigenvalues 
are roots of polynomial equations. The probabilities p;; are then 
represented in terms of the corresponding eigenfunctions. 
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The constant coefficients in these representations are determined 
from the boundary conditions and the normalization condition that 
the sum of the probabilities is unity. There are two sets of constants, 
corresponding to the representations of the probabilities p;; in the two 
shaded regions in Fig. 2a. In general, these constants have to be 
evaluated numerically. However, we do show that the set of constants 
corresponding to the representation in the primary queueing region 
may be expressed explicitly in terms of the other set of constants. This 
further reduction leads to a problem of fewer dimensions than the one 
obtained in the first analysis,’ regardless of the size of gi > 0. 

Various steady-state quantities of interest may be expressed in terms 
of the probabilities p;;. Among these are the loss (or blocking) proba- 
bilities, the probability of overflow from the primary to the secondary 
group, the probabilities that a demand is queued, and the average 
waiting times in the queues. These quantities may be expressed directly 
in terms of the constant coefficients occurring in the representations 
of the probabilities p,;, averting the need to calculate the p,;. This is a 
key advantage of the reduction in the dimensions of the problem. 

The solution is built up in stages. In Section II, we consider solutions 
to the birth-and-death equations in the region0 Sism-—1,0Sjs 
N2 + ge of state space. This corresponds to the case q, = 0, and was 
analyzed earlier as a special case.’ Its inclusion here is for the sake of 
completeness. In Section III, we examine the solution in the region of 
state space corresponding to queueing in the primary, and give a 
heuristic derivation of the stability condition when gi = ~. Section IV 
discusses the boundary and normalization conditions, while Section V 
is devoted to the calculation of various steady-state quantities of 
interest. In Section VI, we show how to achieve a further reduction in 
the dimensions of the problem by the introduction of a generating 
function. This reduction is obtained in another way in Section VII. 
Properties of the eigenfunctions that occur in the representations of 
the probabilities p,;; are given in Appendix A, the eigenvalues corre- 
sponding to the primary queueing region are discussed in Appendix B, 
and results pertaining to the generating function are derived in Appen- 
dix C. 


il. REPRESENTATION OUTSIDE THE PRIMARY QUEUEING REGION 
For convenience, we define 
h=m+q, ko = ng + qo. (1) 


Let pi (0 =i = ki, OS 7S kz) denote the steady-state probability that 
there are i demands in the primary, and j demands in the secondary. 
Define also the traffic intensities 


Qi=Ai/p, 2 = Ao/p. (2) 
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As usual, we let 6;; denote the Kronecker delta, i.e., 


Wel J bys Us 
w= {) fae ) 
Finally, we introduce the function 
_ jl, 120, 
w= {i 1<0. ) 


The probabilities p;; satisfy a set of generalized birth-and-death 
equations. For case I, it may be shown’ that 


[ai(1 — 8i%,Xj-ny) + ae(1 — dj2,) + min(Z, m1) + min(J, n2)] pi 
= (1 — Xi-ny-1Xnq—j-1) [Ai(1 — 810) Di-a, 
+ (1 — Sx,)min(j + 1, n2e)pi js] (5) 

+ (1 — §)0)[ar8in, Xn.-j + a2(1 — Xi-n,-1Xn,-/) ] Di j—-1 
+ (1 — ix.) (1 — Xin, Xn .y-1) mini + 1, 21) + NexXi-n, Sin. Dit1,), 
for0 =<isk,,0 <j] Ss ko. These equations were constructed to imply 
that 

py=0,m+1sishi, 0sjsn—1], (6) 
since there can be no queued demands in the primary when there is a 


free secondary server. 
The normalization condition is 


ny kg ky ko 
LP Dd pe 1. (7) 
i=0 7=0 t=n,+1 J=ny 


For 0 = iS nm — 1 and 0 Sj S hz, the variables in (5) may be 
separated,’ and there are solutions of the form a;8;, where 


(a, +i + p)ai = ai(1 — Sio)ai-1 + (i + Daisi, (8) 
for 0 =i<=n,-— 1, and 
[a2(1 — d,) + min(j, m2) — pl; 
= a2(1 — §j0)Bj-1 + (1 — S,)min(7 + 1, ne) Bj, (9) 


for 0 = 7 = ke, and p is a separation constant. The solution of (8) 
may be expressed in terms of Poisson-Charlier polynomials.*® We 
denote here the solution of (8) for which ao = 1 by s,(p, a1). The prop- 
erties of s;(A, a) which we will need in the analysis are contained in 
Appendix A. 

For 0 $7 = nz — 1, (9) becomes 


(a2 +7 — p)B; = ae(1 — bo) Bi-a + (J + DB, (10) 
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and we see that there are solutions £; proportional to s;(—p, a2) for 
0 <7] S ne. For ne =] S ho, we find that 


[a2(1 — dj.) + m2 — p] Bj = a2Bj-1 + no(1 — dj2,)Bj+1. (11) 
To find a representation for solutions to (11), we define’ 
1/2 
ne Qo +N p 
Vi(p) = (=) u(2=B=*) (12) 
a2 2Va2ne 
and 
&j(p) = Vx,-j() — Ve,-j-1(p), (13) 


where U;(x) denotes the /th Chebyshev polynomial of the second 
kind.’ The properties of these functions are discussed in Appendix A. 
However, we note here that Uo(x) = 1, U-1(x) = 0, and hence 
oz,(0) = 1. From (11), and (134), it follows that 8; is proportional to 
(p) for n2-1s7 Ske. 

Thus, as a basic solution to (9), we take 


igen s;(—p, Q2)on,(p), 0 =J = ne, 
Bi = a a2)$;(p), nz — 1 <j = ko, ~ 


where 
Sn.-1(—p, A2)n,(P) = Sn,(—p, A2)hn.-1(p). (15) 


With the help of (13), (128), (129), and (133), (15) may be expressed in 
the form, 


pLSn,(1 — Pp, a2) Vq,(p) _ Sn.-1(1 — Pp; a2) V 7,-1(p) J = 0. (16) 


The expression in the square brackets is a polynomial in p of degree 
ka. It was shown’ that its zeros are positive and distinct, and we denote 
them by pm, m = 1, 2, «++, ke. We adopt also the convention po = 0. 
The calculation of the {p,} for m > 0 is related to an eigenvalue 
problem for a symmetric tridiagonal matrix.’ 

For the moment, we consider q: = 0, so that k; = nm. This corresponds 
to the case of queueing in the secondary only, which has been analyzed 
as a special case.’ In terms of the separated solutions to (5) for 
0<1=n, — 1, arepresentation of the solution for0<ism,0Sjs ke 
is 


Re 
y AnSi(Pm, a1)8;(—pm, A2)Gn,(Pm), 0 =J = ne, 


m=0 
Peay oe (17) 
Y dnSilPm, A1)$n(—Pm, A2)bj(om), nmeSjs ko. 


m=0 
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The constants {d,,} are determined only to within a multiplicative 
factor by the boundary conditions at i = m, 0 = J S kz, since the 
boundary condition at 7 = nz may be shown to be redundant.’ The 
multiplicative factor is determined by the normalization condition (7). 
In general, the constants {d,,} have to be calculated numerically. 

For qi > 0, the representation (17) may still be used for 0 =i sm, 
0 <j she, but a different representation must be found for n} + 1 
i = ki, no Sj S kp. The earlier approach’ was to find separated 
solutions to (5) in the region, +15 i1Ski,n2+1Sj)S ho, 1.e., away 
from the boundary at j = nz. This led to a representation of pj; for 
m Stsk, n2SJ Ske. Solutions were found that match those in (17) 
for t = nj, ne =] S ke, and an additional q: solutions were found that 
vanish at i = n,. By the superposition of a suitable linear combination 
of these g; solutions, the boundary conditions at 7 = n2,m+1Sis 
k, could be satisfied, without disturbing the matching of the represen- 
tations at i = m1, Ne <j = ke. The constants {d,,} are still available to 
satisfy the boundary conditions at 7 = m, 0 <j S ke, and again the 
condition at 7 = n2 is redundant. 

Since the number of additional constants to be determined in the 
above approach is q, this procedure is not suitable if q: is large. In the 
next section, we investigate an alternate approach that is suitable if qi 
is large or even infinite. 


lil. REPRESENTATION IN THE PRIMARY QUEUEING REGION 


We now assume that gi = 2 and q2 = 1, although these restrictions 
may be relaxed. For n} + 1 Sisk; — 1 and n2 Sj S ke, the variables 
in (5) may be separated and there are solutions to the partial difference 
equations of the form 


a fF. n=ix ky, no <j = ke, 


where 
| (m - “a — 6) + ao(1 — dx,) + no(1 — §n,0) 7 


= a2(1 — Sin.) fj-1 + n(1— Sin) fi+1, ne =) =k2, (18) 


and o is a separation constant. Since a; ¥ 0, and we are interested in 
nontrivial solutions, o ¥ 0. In particular, for ne + 1 <7 S ke, we have 


[a2(1 — Sx.) + ne — w)fj = Aefj-a + no(1 — Se,)fi+1, (19) 


where 


ne (2 = m)a ear (20) 
oO 
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We see from (134) that a solution which satisfies the boundary condi- 
tion at 7 = ke is 


= fro), Ne SJ Ske, (21) 


with ¢,; defined as in (13). 
It remains to satisfy the boundary condition at 7 = nz, namely, 


[a2 + no(1 — 0) — wl fn, = Nefng+1- (22) 
With the help of (21) and (13), we obtain 
N21 — o)dn,(w) + (a2 — w)[Vo,(w) — Vo,-1(w)] 
— nf[Vq,-1(w) — Vq,-2(@)] = 0. (23) 
This reduces with the help of (20) and (133) to 
(1 — o)H(o) = 0, (24) 


where 
H(o) = (x, — Sy) wr N2hn,(w), (25) 


with w defined as in (20). Hence, o = 1 or H(o) = 0. 

Now W(w) is a polynomial of degree J in w. Hence, o?*'H(o) is a 
polynomial of degree 2q2 + 1 in o, so that H(c) = 0 has 2q2 + 1 roots. 
In Appendix B, we show that these roots are positive and distinct, and 
that at least g2 of them lie between 0 and 1, and at least g2 of them are 
greater than 1. More precisely, define 


= = Wq,-1(0) 
Ai=m+ ml ce . (26) 


Then, if ai < Ai, g2 + 1 roots lie between 0 and 1, and qe roots exceed 
1. If a, > Ai, then gz roots lie between 0 and 1, and q2 + 1 roots exceed 
1. Further, if a; = A;, then H(1) = 0. We denote the roots of H(o) = 0 
by o,,r=1, -+-, 2q2+ 1, withO < 0, < --- < 629,41. For convenience, 
we define oo = 1. Then og, < 1 and 67,42 > 1, and 69,41 < 1 if a; < Aj, 
and 07,41 > 1 if a; > A,. Note that o-, r= 0, ---, 2g2 + 1 are distinct 
if a; # A;. On the other hand, oo = 04,11 if a1 = Ai. 

For q: < , we assume for simplicity that a; * A. We then have 
2q2 + 2 distinct roots, and it follows that we may represent the 
probabilities in the form 


2qot+1 


Di = bs eo; (wr), ny, = l = hi, N2 = 7 = ko, (27) 


r= 
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where 


‘es (2 a m)(a ~ G,) 
Or 


and the constants {e,} are to be determined. Consistency of the 
representations (27) and (17) at i = m requires that 


2qo+1 kg 
y €rh;(w,) = » AmSn,(Pms Q1)S$n,(—Pm, a2);(Pm), (28) 


for ne Sj Sho. 

For gi = ©, we assume that a; < A:; the reason for this will be 
apparent shortly. We use the representation (27), but the normalization 
condition (7) implies that eo = 0 and e, = 0, r = q2 + 2, +--+, 2q2 +1 
since 069 = 1 and o, > 1, r= qo + 2, --- , 2g2 + 1. We thus have g2 + 1 
conditions to replace the gz + 1 boundary conditions lost on removal 
of the boundary at i = ki. If a; > Ai, then o,,.1 > 1, and we would 
have to take e,,41 = 0 also. However, this would leave only g2 constants 
er, r= 1, -++, go to satisfy the ge + 1 conditions (28), which is 
insufficient. 

The stability condition ai < Ai, for qi = ©, may be derived heurist- 
ically as follows. Consider the situation when there are demands 
waiting in the primary queue. It follows that all nz servers in the 
secondary are busy. As the point of instability is reached, the primary 
queue will never be empty, and hence acts as an infinite source for the 
secondary servers. Demands from this source enter the secondary only 
when a server becomes available and there are no demands waiting in 
the secondary queue. Since the secondary system is a loss system, it is 
ergodic. . 

Let P;, 7 = ne, +++, ke, be the steady-state probability that there are 
j demands in the secondary. Then the rate at which demands in the 
primary queue are served in the secondary is n2u4P,,, and the rate at 
which they are served in the primary is mip. Since Ai = aup, it follows 
that the stability condition is 


a; <1 + NeP»,. (29) 


But, as for the system without the infinite source,” 


J—N2 
P; = (=) Pn, ta <j<he. (30) 
Ne 
With the infinite source present, P; = 0, 7 = 0, --+, m2 — 1, and P,, is 
determined by 

ko 
¥ P; = 1. (31) 

J=N2 
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From (30), (31), and (136), we obtain 


- = W,,-1(0) 
P,, = E Fa | (32) 


From (26), (29), and (32), it follows that the stability condition for 
n= coisa; <A). 


1V. BOUNDARY CONDITIONS 


The constants {d,} and {e-} introduced in (17) and (27) must be 
chosen so as to satisfy the boundary conditions at i = n, and i = ky. 
The boundary conditions at i = k; imply that 


[ao(1 — djx,) + mi + Ne] De,.j 
= @1Pr,-1,j + G2(1 — S)n,)Pe,,j-1 + Noll — Sjx,)px,j+1 (33) 
for n2 SJ = ko. Atl = ny, we have 
(a) + a2 +m +7)Dn,,j 
= Q1Pn,-1,j + (Qi + @2)(1 — Sj0)pn,j-1 + (J + Upn,j+i (84) 
for0 <j sn — 1, 
(a + a2 + m + N2)Dryn, 
= A1Pn,-1,n. + (1 + A2)Pnyny—-1 + N2Pnyny+1 + (M1 + N2)Pnj+1n,, (35) 
and 
[ai + a2(1 — Sx,) + m1 + ne] Dn,,; 
= QPn,-1,j + A2Pn,j-1 + MPnj+1,7 + NAL — Ojx,)Pn,j+1, (36) 


forn2+1sjs ko. 
If we substitute (17) in (34), we find, on reduction with the help of 
(125), (128), and (129), that 


ke 
»> Amon,(Pm){ PmSn,(1 + Pm, a1)s(—pm, 2) 
m=0 


+ QiSn,(Pm, G1)8(—-1 — pm, @2)] = 0, (37) 


for 0 = 7 = nz — 1. Similarly, if we substitute from (17) and (27), (36) 
implies, after reduction, that 

ho 

»> AmPmSn,(1 + Pm, Q1)S$n,(—pm, A2);(Pm) 


m=] 
2qo+1 
— ¥ eno, — ai)d(wr) = 0, (38) 
r=0 
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for n2 + 1 <7 S ke, and (33) implies, with the help of the fact that f; = 
;(w,) satisfies (18) for 0 = r <= 2q2 + 1, that 
2qotl 
Yeo [(n1 + N2djn,)Or — Ai]ow,) = 0, (39) 
r=0 
for no =] S ke. Finally, (17), (27), (15), (125), (128), and (129) in (35) 
imply that 


ke 
> Amn,( Pm) PmSn, (1 + Pm, Q1)S$n,(—Pm; a2) 
m=0 


+ @18n,(Pm, @1)Sn,(—1 — Pm, a2) | 
2qo+1 


— (n; + ne) 2 €,Orgn, (wr) = 0. (40) 


We will now show that the boundary condition at i = n, Jj = nz is 
redundant and may hence be dropped. Summing (39) from 7 = nz to 
ke, we find that 


2qot1 
Y eo? [(n190, — ay) Vq,(wr) + 120;bn,(w,)] = 0. (41) 
r=0 
For r = 1, 2, «++, 2q2 + 1, it is true that 
a 
H(o,) = (n, - 2 )¥9 (0) + negn,(wr) = 0, (42) 
and oo = 1 and wo = 0, so that (41) and (26) imply that 
€o(A1 — a1) = 0, 
that is, 
eo = 0, (43) 


since a; # A, by assumption. The redundancy of the boundary condi- 
tion (40) may then be verified by summing (37) from 7 = 0 to nz — 1, 
(38) from 7 = n2 + 1 to ke, and (28) from j = nz to ke, and by using the 
facts that p, satisfies (16) for m = 0, 1, 2, --+, ke, and eo = 0, together 
with the recurrence relations found in Appendix A. 

Consider (39) for 7 = m2. We note that ¢,,(w,) is nonzero; otherwise 
(22) with f; = $(w-) would imply that ¢,,+:(w,) = 0, and consequently 
that ¢;(w,) = 0, in contradiction to the fact that ¢;,(w-) = 1. Further, 
since the roots of H(o) are distinct, it follows that not all of the square- 
bracketed terms are zero. Hence, the equation is nonvacuous. Since 
summation of (39) from 7 = nz to ke implies eo = 0, if we set eo = 0 in 
(38) and (39), then (39) for 7 = nz is redundant and accordingly may be 
dropped. We remark that even for a; = Aj, we are still free to take 
€o = 0 since 6y,41 = 1 in that case. As before, the boundary conditions 
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at J = ng (it = nm and i = kj) are redundant. Therefore, it follows that 
the constants {d,,} and {e,} are determined by the boundary conditions 
at i = n, for 0 Sj S m — 1 and m+ 1 5/ S ks, the consistency 
relationships at i = n, ne Sj S ke, the boundary conditions at i = ky, 
no + 17 S kz if qi is finite, and the normalization condition (7). 
We summarize the results of the last three sections as follows: 
he 
>. AnSi(Pm; Q1)S;(—pm, A2z)hn,(Pm), 0 <j =n, 
py = 4" (44) 


he 
Y AmSi( Pm, @1)$n.(—Pm, A2)6j(Pm), ng =Js ko, 


m=0 


for 0 <i <n, and, 
2qot1 


Py= > €or ™;(wr), Nn2=J sk, (45) 
r=1 


for n; Si = ki, (qi < ©), where 
Pm[Sno(1 — Pm; A2)Vq.(Pm) — Snq-1(1 — Pm, A2)Vq,-1(Om)] = 0, (46) 
for m = 0,1, 2, ---, Re and 
H(o,) = 0, r=1,2,3,---,2q2+1 (47) 
with w, = (= - ms)(a ~ Gy). 
The ke + 2qo + 2 constants {d,,} and {e,-} are determined, with eo = 0, 
by (28) and (37) to (39) only to within a multiplicative constant, which 


is determined by the normalization condition. From (44), (46), (13), 
(130), it follows that 


ke 
»y Di = dosi(0, a1)[Sn,(1, a2) V,,(0) Sn.-1(1, a2) V¥,,-1(0)], (48) 
j=0 


for 0 = i <= n,, so that (7), (13), (45), and (130) imply that 
dos, (1, a1) [Sn,(1, a2) V7, (0) = Sn.-1(], a2) V4,-1(0)] 


2qgo+1 


+ ¥ eFy,(o,)¥q,(wr) = 1, (49) 
r=1 


where 


q-1 
= 1 — &%)/(1 - 8), 1, 
[=0 q; g =]. 
We point out that the special case gi = 1, g2 = 0 has previously been 
analyzed.’ In the primary queue region, solutions of the form 





i—ny; 
a . 
Din, = (— re -) Pny,ng) n=is hi, (51) 
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where 
ke 
Pn,,nyg = x, AmSn,( Pm, @1)Sn,(—Pm, Q2)hn,(Pm), (52) 


were found. We note that since kz = ne for gz = 0, dn,(p) = ox,(o) = 1, 
above. For gi = ©, the stability condition was found to be a; < m1 + no. 
It is readily verified that the representations (44) and (45), and the 
stability condition ai < A1, produce the same result. 

For gi = © and qg2 = 1, we must set e, = 0 for r = go + 2,---, 
2q2 + 1, as well as eo = 0, as discussed in Section III. The remaining 
ko + Qo + 2 constants {d,,} and {e,} are then determined by (28), (37), 
and (38) to within a multiplicative constant, which is determined by 
the normalization condition. But, from (50), 

g 


Y..(%) = ba Ty, (g) = 1-8)’ 0Osé<1. (53) 


Hence, for gi = ~, the normalization condition (49) is 


doSn,(1, @1)[Sn(1, A2)¥o,(0) — Sn.-1(1, a2) ¥o.-1(0)] 





tl eg, v 
+L Gasp Valo = 1. (64) 


V. SOME STEADY-STATE QUANTITIES 


We now proceed with the calculation of various steady-state quan- 
tities of interest. These quantities are shown in Fig. 1, which depicts 
mean flow rates. The loss probabilities are, for the primary and 
secondary, respectively, 

hy hy 


In= DY Pris L2 = ¥) Pikes (55) 
J=Ny i=0 


while the probabilities that a demand from the primary, or secondary, 
source is queued on arrival are 


ky-1 ke ky ko-1l 
Qi= Y DX p#,(m2=1), @=) Y py, (q2=1). (56) 
i=n, j=ng i=0 j=ng 


The probability that a demand arriving from the primary source 
overflows immediately is 


Ip. = Dy Pn,,j- (57) 


Since the mean service rate is » for each server, the mean departure 
rate from the primary queue to the primary servers is 


Se 
Ru=nyp Y YY vpy, (m=, (58) 
i=n,+1 J=nz 
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while the mean departure rate from the secondary queue is 


ky ko 
Re=neyY > py, (q2=1). (59) 
t=0 J=np+1 


The mean rate at which queued demands in the primary overflow to 
the secondary servers is 


Riz = nop 5 Pings (qi = 1). (60) 


t=n,+1 


The average queue populations are, for the primary and secondary, 
respectively, 


ky 2 
w= SY G-mpy, (=D, (61) 
t=n,4+1 j=n, 
and 
Boh 
v=) Y (J-ne)py, (g2 = 1). (62) 
t=0 J=np+1 


Also, the average number of demands in service pe the two groups are 


ny ko ky 
=) > pytm(il—- 84,0) > 5 Di; (63) 
t=0 j=0 i=n,+1 j=nyg 
and 
ny, no-1 ky ko 
X2= >) Y jpytne yd Dd dy. (64) 
i=0 j=0 i=0 j=ny 


An application of Little’s theorem® to the primary and secondary 
queues shows that the average waiting times of the queued demands 
in the primary and a are given by 


Wi = >1 W2 =— >1 65 

i= wo »(m 21), 2 a »(qz= 1), (65) 

respectively, independently of the service order within each queue. 

Also, if we apply Little’s theorem to the primary and secondary groups 
of servers, we obtain 


Ai(1 — Li — Tia) — Rig = pX1i, Ao(1 — Le) + Avie + Riz = pXe, (66) 


since the mean service rate is p. 

The steady-state quantities of interest may be expressed directly in 
terms of the constants {d,,} and {e,}, with the help of the representa- 
tions (44) and (45). From (55), with the help of (13), we find that 


2qgot+1 


= Vo eoP¥,,(w,), (67) 
r=] 
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and with the help of (130) and the fact that ¢z,(p) = 1, we find that 
Ro 2gotl1 
Le = »y AmSn(1 + Om; Q1)S$n,(—Pm, 2) + y; e-I’,,(o,), (68) 
0 r=1 


where I°,(&) is as defined in (50). Similarly, it follows from (57) and 
(1380) that 


ho 
Te = »> AmSn,( Pm, @1)S$n,-1(1 — Pm, 2)bn,(Pm). (69) 


m=0 


Next, we find from (56), (50), and (13) that 


2qot1 e, 
Q= » (2) Tg, (6-) ¥q,(wr). (70) 
r=1 r 
Also, (58), (50), and (13) imply that 
2qotl 
Ru=mp Y eTg,(o)¥e(or), (71) 
r=1 
while from (60) and (50) it follows that 
2qotl 
Ry»= Nop > e-1'g, (Gr) dn, (wr). (72) 
r=1 


It may be shown directly from (5) that 
Ry + Riz = AiQ,, (73) 


and, using (42), it is readily verified that this is consistent with the 
representations (70) to (72). 
At this point, we define 


ky 
n= py, nmsjske. (74) 
i=0 
If we sum on 7 in (5), we obtain 
[ao(1 = Sin.) + Nne]r; = Aerj-1 + n2(1 = jn) Tj+1, (75) 
for ng + 17 S ko. It follows that 
Ne; = AeP;-1, n+1sjsko. (76) 


Hence, since Lz = r;,, from (55) and (74), we find that 


ko-j 
n= (2) lo, m<j<ho. (77) 


Then, it follows from (50) and (59) that 


Ro = raPa(@) Le (78) 
2 
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Also, from (56) and (77), we deduce that 
Qs = r=) ta. (79) 
a2 


These representations satisfy Ro: = \2Q2. We remark that this latter 
result, as well as (73), hold since, in the steady state, the average 
arrival and departure rates are equal, for both the primary and sec- 
ondary queues. 

From (61) and (13), we find that 


2qgot+l 
Vi= ¥ e-Ag,(or)Vo,(wr), (80) 
r=1 
where 
_ we gti séll—( (a+ Dé? + gg" V—-s’, §A1, 


We note in passing that 
T,(é) = gé7"" + (1 — €)AQ(E), (82) 
a formula that may be numerically useful. From (62), (74), (77), and 


(81), it follows that 
q2 
Vo = (=) Ao( 2) (83) 
a2 ne 


Finally, from (63), with the help of (7), (48), and (131), we find that 
X = M — doSn,-1(2, i)[Sn,(1, A2)¥q,(0) — Sn.-1(1, A2)Vq,-1(0)], (84) 
and, from (64), with the help of (7), (17), (180), and (131), 


ho 
X2 = Nz — x, AmSn,(1 + Pm, A1)8n.-1(2 — Pm, A2)on.(Pm). (85) 
We remark that in view of (67) to (69), (72), (84), and (85), the relations 
presented in (66) provide a useful numerical check. 

We point out that the results of this section up to this point are 
valid for gi < «. We recall that for gi = ~, e, = 0 for r = qo + 2, +++, 
2q2 + 1, since o, > 1 for these values of r. In addition, the stability 
condition a; < Ai, with A, defined in (26), is necessary. Under these 
assumptions, since 0 < 0, < 1 for r=1, ---,q2 +1, we find, analogously 
to (67), that 


Ty | q:=c0 = 0, (86) 


as expected. Also, with the help of (53), we obtain, analogously to (68) 
and (70) to (72), 
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qot1 


€,0y 
L2| q-0 = pa AmSn,(1 + Pm, A1)S$n,(—Pm, A2) + 2 er, (87) 
and 
qotl e; 
Q1| ¢,=0 = x =o): WVo,(wr), (88) 
gotl 
Rulqeo = rap Y, Valeo), (89) 
r=1 (l- Or) 
and 
qo+1 
Ri2| ¢q,=-0 os Nop 2. sey 5 ———- $n, (wr). (90) 
Similarly, from (81), we have 
g 
A.(€) = lim A =—_—__.. , 0<é< 1. 91 
(g) bial a (&) io g (91) 
Hence, corresponding to (80), we obtain 
go+l €;0r 
Vilqeo = YY ——ay Pe, (wr). (92) 


m1 (1—o,) 


The expressions for liz, Rez, Q2, V2, Xi, and Xe, given by (69), (78), 
(79), and (83) to (85), still hold. Of course, the constants {d,,.} and 
{€m}, which occur in these expressions, (87) to (90) and (92), are those 
pertaining to gi = ©, and are determined in the manner discussed at 
the end of Section IV. 


Vi. ELIMINATION OF ONE SET OF CONSTANTS 


We will now show how to determine the constants {e,} in the 
representation (27) of the probabilities p,; for n; SiS ki, ne Sj S he, 
in terms of the constants {d,,} in the representation (17) of the 
probabilities p;; forO =i=m,0S7 Ske. Form Si<hi,nsjs ke 
we have, from (5), 


[ai + ae(1 — 8%,) + mi + ne] Di 
= 41 Pi-1,; + [(Q1 + G2) bin, Sin, + Ao(1 — Syn.) | Dij-1 
+ (1, + n26jn,) Pisi,j + N21 — Sj.) Di jt. (98) 


We consider this equation for n; = i < », and introduce the generating 
function 


Gj(z) = piz'™. (94) 


The series will be convergent for sufficiently small |2z]. 
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We consider z ¥ 0 and define 
sa(1= a(% = a). (95) 
Then, from (93) to (95), it follows that 
| at — Sp.) + m(1 - : in] _ |e 
— a2(1 — bjn,)Gj-1(z) — neo(1 — 852.) Gi+1(2) 
= M1 Pn,-1,7 + (A1 + G2)8jn,Pn,,j-1 — : (21 + N28jn,)Pn,,j, (96) 


for n2 <j = ke. We now define G!(z) for ne = 1 <= kz as the solutions 
of the equations 


at — 5j,) + m(1 - : bn) - Jarre 
1 
— a2(1 — 8n,)GO,(z) — na(1 — §)2,)EO\(z) = - 5 bit (97) 


for ne =] = ko. Then, from (96) and (97), we have 


ko 
G;(z) = p> (1 DPn,u — @12Pn,-1,1)GS(z) 
=No 
+ [n2Pnyng — (Q1 + G2)ZPnyny-11GS(z). (98) 
We define 
a2\" n 
T(z) = (2) tient _ ¥ina®) (99) 


where V;(p) is as defined in (12). It is shown in Appendix C that the 
solution of (97) is 


(=)~ 0 

my) 10=2Hd2) 

G}(z) = (100) 
@)" THz)gi(0) ae 
az) (l1—2)Hjzy 


for n2 = 1 = kez, where ¢;(p) and H(o) are as defined in (13) and (25), 
respectively, with w as defined in (20). We know that H(o,) = 0, r = 
1, -++, 2g2 + 1, with 0 < 0; < +++ <09,41. As discussed in Section III, 
Og, < 1, 69,42 > 1, and 09,41 < 1 if a: < Aj, and 09,41 > 1 if ai > Ai, 
where Aj is given by (26). We assume that a; ¥ Aj, so that 04,4: ¥ 1. 
Now, from (12), (13), (20), (25), (95), and (99), z"H(1/z), 2**%@;(&) 
and z/-”;(z) are polynomials in z of degrees 2g2 + 1, 2(k2 — j) and 
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2(j — na), respectively. It follows from (100) that 2G (z) > 0Oasz—> 
o, for ne <j Ske, no =1< ke. Hence, from (98), we have G;(z) > 0 as 
z— ©, for ne =/j S ko. It follows that we may expand G;(z) in partial 
fractions in the form 

2qot+1 


. = _ 8 <7 
G;(z) x (= Won’ nesjsko, (101) 
where oo = 1, and 
gir = — [(z — 1/0,-)G;(z)]. (102) 


From (93), for nm; = 1 < ki, (94), and (101), we deduce that 
2qotl 


py=—- Y of g,, msish, mSjsko. (103) 
r=0 


We proceed with the explicit calculation of gj. Let 
gic lim [(z — 1/0-)G{(z)]. (104) 


Then, from (98), (102), and (104), we obtain 
ho 

Or 8jr = > (110;Dn,,1 ~ M1pn-apEy 
=n 


+ [2267 Dn,,no = (a1 “ A2) Drymy1E (105) 


As we noted in Section IV, ¢n,(w-) ¥ 0. It is shown in Appendix C 
that 


oy (w,) 
on,(@r) ; 


with w, given by (47). Hence, from (100), (104), and (106), since oo = 1, 
we obtain 





T;(1/o,) = ne Sj Ske, (106) 





w (=)~ $1(0)6;(0) __ 4, (0) 


H a2) n,(0)H() HQ) (07) 


from (137). We note that H(1) ¥ 0, since we have assumed that a; # 
A.. We also obtain 


l—n 
wm _(m) © elor)dj(or) oe 
ie (z) mi-oiGiéol “8 8 eee ee 


where the prime denotes derivative. 
From (27), (103), (105), and (107), since oo = 1, it follows that 
hg 
H(1)eo = _ (11 Pn,J = Q1Pn,-1,) + N2DPny ny — (ai + Q2) Dn, np-1- (109) 


l=ny 
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If we substitute for pz, and Pn,-17, M2 = 1S ke, andpn,n,-1 from (17), 
and sum on J, and use (13) and (128), we obtain 


Re 
H(1)eo = > Am[PmSn,-1(1 + Pm; Q1)$n,(—Pm, a2) V 4,(pm) 
m=0 


+ N2Sn,(Pm, A1)Sn,(—Pm, A2)n,(Pm) 
— (Q1 + A2)Sn,(Pm; A1)Sn.-1(—Pm, A2)n,(Pm)]. (110) 
After reduction with the help of (46) and some recurrence relations, 
we deduce from (110) that 
Ro 
H(1)eo = — SY dnon,(pm)[PmSn,(1 + Pm, A1)Sn,-1(1 — Pm, a2) 
m=0 . 


+ Q18n,(Om, @1)Sny-1(—Pm, Az). (111) 


But, summation of (37) from 7 = 0 to n2 — 1 shows that the term on 
the right-hand side of (111) is zero, so that 

H(1)eo = 0. (112) 
Since H(1) # 0 for a, ¥ Aj, it follows that eo = 0, as was shown in 


Section IV. . 
Next, from (27), (103), (105), and (108), we deduce that 


o-(1 > or) H’ (Gr) dn, (wr) er 
kg Ne I-ng 
= VY (apni — msorPn (2) oi(wr) 
l=nq a2 
+ [ (ai + Q2) Pnyn.-1 — N26 Pn, no ]bn,(Wr), (113) 


for r = 1, +--+, 2q2 + 1. If we substitute for p,,,, and Pn,-1,, ne S 1 


<= ke, and Pn,,n,-1 from (17), we obtain 


Re 
o-(1 = o,)H"(o,)e, = »y Amla1Sn,—-1(Pm, ai) _ 1107Sn,(O ms a1) | 
m=0 


he l—ng ko 
*Sn.(—Pm; a2) », (=) $1(Pm)1(w,) /dn, (Wr) + a AmSn, (Pm; a1) 


l=ng 
«(a1 + G2)Sn,-1(—Pm, 2) — N20rSn,(—Pm, G2) |on,(Om), (114) 


for r= 1, ---, 2qg2 + 1. The sums on J are given explicitly by (144), or 
by (145) if pm = w,. 

If we set o, = 1 in the right-hand side of (114), so that w, = 0 from 
(47), it is found after reduction that the expression reduces to —H(1)eo, 
as given by (111), and hence is zero, from (112). It follows that 
(1 — o-) may be factored from both sides of (114). Consequently, we 
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may consider the limiting case a; — Aj, and obtain a finite expression 
for é,+1, even though 04,41 > 1 as ai— A1. 

Now that we have determined the constants {e,} in terms of the 
constants {d,,}, it remains to determine the latter. These are deter- 
mined from (37) for 0 <7 S ne — 1, (39) for ne + 1</7 S ke and (49). 
The redundancy of (39) for 7 = nz was discussed in Section IV, and 
follows from the fact that H(1)eo = 0. Thus, the constants {d,,.} are 
determined from the boundary conditions at i = m,0<j <= ne —1 and 
at t = ki, no + 17 S ke, and from the normalization condition, the 
boundary condition at i = ki, 7 = mz being redundant. The constants 
{d,,} have to be calculated numerically. 


Vil. AN ALTERNATE DERIVATION 


In this final section, we will give another derivation of the expressions 
for e@o9 and e,, r = 1, -++, 2ge + 1, given in (109) and (113). This 
derivation is more direct, but somewhat obscure, and was not evident 
until the results had been established with the help of the generating 
function approach. 

We make use of the representation (27) for the probabilities p;,, 
which holds for nm, = i S ki and ne S J S ke, and the boundary 
conditions at i = mi, ne SJ S ko, as given by (35) and (36). In particular, 
from (27) we have 

2gotl 


Puyi= DY eroj(wr), msjsko. (115) 


r= 


Next, from (27) and (36), with the help of (47) and (134), we obtain 


2got+1 e; 
Dn-uy= Y; g(wr), MNet1lsjsko. (116) 
r=0 r 
Also, from (27) and (35), with the help of (47) and (140), we find that 
2go+1 
@1 Pn,—-1,nq = 3 (a, + Q2) Pn, ,np-1 =a > = hn, (Wr). (117) 


From (115) to (117), it follows that 


ko no J-n2 
Y, (1 Pn,-1,7 — M10mPn,,j) (=) j(@m) 
jany mA 


+ [(a1 + G2) Pny,no-1 = N20m Pn, ny \bn.(Wm) 


Qgotl ay ko no J—n2 
= On x e| (= - ms) i, (=) $j (Wm); (wr) 


~ nan (om) bu (0) | (118) 
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Now, from (47), 





(w, — &m) = (Om — G7) (= = m). (119) 


But, for a; * Ai, we know that o-, ¥ on for r ¥ m. Hence, if we set 
p = Wm in (143), and make use of (119) and (142), we obtain 


a ke Ne j-n 
(= - ms) 2, (=) $j (@m) oj (wr) 


= N2hn(Wm)on,(wr), r a m, r, m= 0, Pe § 2q2 + dy (120) 


This implies that all the terms on the right-hand side of equation 
(118), except the ones corresponding to r = m, are zero. 
Next, from (13), (20), (25), and (133), it follows that 


(1 — 0) H(0) = a2$n,-1(w) — N2don,(w), (121) 


and 


d 
5 [(1 — 0) H(o)] 


= (n, - “) [aepn,-1(W) — Neopn,(w)] — Nadn,(w). (122) 


Hence, from (145) we obtain 


ko | J—~Nn 
(3 = m) ») (2) [di (wr) P — nelon(wr)? 


r J=Nng 2 
= orton [a- ovHt)| ,  r=0,+++,2g2+1. (123) 


But, oo = 1 and wo = 0, and H(o,) = 0, r= 1, «++, 2g2 + 1, so that 


d _ | -HA(1), r=0 
(Zta-cntor} =e ren, Palace. 


The expressions in (109) and (113) for eo and e,, r= 1, ---, 2q2 + 1, 
respectively, follow directly from (118), if we set m = 0 and m = 
1, -+-, 2q2 + 1, respectively, and make use of (120), (123), (124), and 
(137). 


APPENDIX A 
Properties of the Eigenfunctions 
We define s;(A, a) by the recurrence relation 
(a +i +A)s;(A, a) 
= a(1 — dio) si-1(A, a) + (0 + 1)si+1(A, @); So(A, a) = 1, (125) 
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for i = 0, 1, ---. Thus, s,(A, a) is a polynomial of degree n in both A 
and a, and it may be related to a Poisson-Charlier polynomial.*® 
However, we will give here the properties of s,(A, a) that we will need. 
An explicit formula is’ 


ae 


Gan? (126) 
where 
(A)o = 1, (A) =AA +1) -+- A+tr— 1D), r=1,2,---. (127) 
It was also shown’ that 
(t + 1)siai(A, a) = as;(A, a) + As,(A + 1, a), (128) 
and 
si(A, a) = si(A + 1, a) — (1 — 8i0)si-1(A + 1, a). (129) 
From (129), it follows that 
2 si(A, a) = s,(A + 1, a), (130) 


and, from (128) and (130), we deduce that 


Stns One) = sea: (131) 


t=0 


We now turn our attention to the Chebyshev polynomials of the 
second kind,’ U;(x). They may be defined by the recurrence relation 


2xUi(x) = Utai(x) + Ui-i(x); U_i(x) = 0, Uo(x) = 1, (182) 
for 1=0, 1, ---. From (12) and (1382), it follows that 
(a2 + Ne — p)Vi(p) = a2Vi+1(p) + n2¥i-1(p); 
Vi(p)=0, Wolpe) = 1. (133) 
From (13) and (133), we deduce that 
[a2(1 — d;2,) + ne — plop) = aogj-1(p) + no(1 — djx,)y+:(9), (184) 


for j < ke. Since’ 





UF et. | 2 ass (185 
ote ae @ 
it follows that 
Ne sg a2 + Ne : n2 ‘ 
von (2) oft) $2). om 
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Hence, from (13), we have a 
$;(0) = (=) . (137) 
a2 
If we set p = w, in (134), and also use (134) as it stands, we obtain 
(wr — p)>;(P) oj (@r) = Gal j-1(P) (wr) — 6)(0)dj-1(r) J 
— no(1 — dr, [6;(p) dj41(wr) — b7+1(p)b;(wr) J. (138) 


If we now multiply (138) by (n2/az)’~", and sum on /, we deduce that 
ITN? 
n 
(=) ;(p) $j (wr) 
a2 


oa A2[ bn,-1(P) dn, (Wr) _ $n,(P) bn,—1(wr) |. (139) 
But, from (21) and (22), 
[a2 Bg no(1 4 0,) =: @r |on, (Wr) = N2On,+1(Wr), (140) 


ky 
(w, — p) ») 


i=ny 


for r = 0, +--+, 2g2 + 1. Also, from (134), since q2 = 1, 
(dz + Nz — Wr) on, (Wr) = Ardn,-1(Wr) + Nebn,+1(@,). (141) 
From (140) and (141), it follows that 
Aagn,—1(Wr) = N2Orhn, (wr), r=0,--+,2qe41. (142) 
Hence, from (139) and (142), we have 
he fp p\i-™ 
(w,—-p) > (=) )(0) 6; (wr) 


JFNy 
= gn, (Wr) A2bn,-1(P) — N2d-bn,(p)], r=O0,--+,2qe2+1. (143) 


If we set p = pm in (143) and make use of the fact that pm, m = 
0, ---+, ke, are the roots of equation (15), we deduce that 


ky j7n2 
(w, _ Pm) Sn,(—Pm; a2) y (=) ; (Pm); (wr) 
jan, \@2 
= Gn, (Pm) n,(@r)[A28n,—-1(—Pm, a2) = N20;S8n,(—Pm, a2) |. (144) 


Also, if we divide (143) by (w, — p) and let p > w,, and use (142) and 
L’Hospital’s rule, we find that 


hy No JN 
> (=) [o;(wr) J 
J=Nny a2 

= on, (Wr) N20, n,(w,) = A2hn,—1(@,) J, r= 0, eae | 2q2 ote 1, (145) 
where the prime denotes derivative. 
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APPENDIX B 
Eigenvalues Corresponding to the Primary Queueing Region 


We consider here the zeros of 
H(o) = (m = 2 yt + nedn,(w), (146) 
where 
ay 
O= (“ = m)(a = 0). (147) 


As may be seen from (12) and (13), Vo,(w) and ¢,,(w) are both 
polynomials in w of degree qo. It follows that o%*' H(c) is a polynomial 
in o of degree 2q2 + 1, so that H(c) = 0 has 2q2 + 1 roots. Now’ 
in(J + 1 
ieee (148) 
sin 0 
We define 





a ) m=1,---, qa, (149) 


Om = Ag + Ne — 2Vagne cos 
m 2 2 242 ( 


and we note that 0 < a1 < --- <a,,. From (12), it follows that 
Vq,(am) = 0, m= 1, a Ry q2.- (150) 
Next, we consider 
Gn.(w) = Vo,(w) — Vq,-1(), (151) 


and we note from (137) that ¢,,(0) > 0. If g2 = 1, then ¢n,(a1) = —1, 
and the unique zero £; of ¢n,(w) satisfies 0 < Bi < a. For q2 = 2, we 
define 


{= a + ma ~ 2aamseos( =), S=1,-+-,@2-—1, (152) 
and we note that 
O<ar<$i< +++ <aq,-1 < §4,-1 < Qy,. (153) 
From (12) and (148), it follows that 
Wo,-1(fs) = 0, S=1,---,gq2-1. (154) 
Then, from (150), (151), (153), and (154), we deduce that 
dnglots) = —Yq.-r(os) <0, ($i) = Van $1) <0, 
n.(a2) = —Wq,-1(a2) > 0, ---. (155) 
We let 
on, (Bm) =0, m=l1,+++,qQ, (156) 
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with B; < B2< +--+ < B,,. Then, it follows that 


0<Bi<ai< §i1< Bo<ag< +++ < §g-1< Bg, <aq,. (157) 


Now, from (147), it is seen that w decreases from + to 0 as o 
increases from 0+ to min(1, a;/n1), and w increases from 0 to + aso 
increases from max(1, a;/7) to +. Hence ym and 62, m = 1, +++, ge, 
are determined uniquely by the relations 


(= — ms) = Ym) = Am, O< Ym < min( 1, “) 
Ym - 
ay 


= = ms) — 8m) = Bm, O0<8n< min( 1, “), (158) 
Om ie 


1 
It follows from (157) that 
0 < Yq, < 89, << +++ < yi < 61 < min(1, ai/7)). (159) 

Now, from (146) and (147), since ¢,,(0) > 0, we have H(a;/n,) > 0. 
But, from (150) and (156) to (158), 

H(61) = (» = oy, (B <0, HA (y1) = nedn,(ai1) << 0. (160) 

1 

Also, for g2 = 2, 


H(62) = (m 


ay 
82 


and so on. Finally, consideration of the sign of Vy,(w) for w ~ + 
leads to 


YB > 0, H(y2) = negn,(a2) > 0, (161) 


—00, if q2 is even, 


ee Caf -@simnedd. 16?) 


H(0+) = 
From the above results, it follows that there are roots of H(o) = 0 in 
the intervals (0, y¢,), (8g. Yao-1), °**» (52, yi) and (6), a:/m). Since 
6; < 1, there are at least g2 roots in the interval (0, 1). 
Next, from (159), we have 











a) Qa, Qa, a} Qa 
max{ 1,—]|< <—<...< < : (163) 
ny md, my 1189, MyY¢, 


Also, from (146), (147), (150), and (156) to (158), we obtain 


(2) = nl = 61) ¥o,( 8) >0, (2) = N2on,(a1) <= 0, (164) 
sea nyi1 


and, for g2 = 2, 


x( ua ) = m= 89% ,(22) <0, (.) =Nxn,(a2) >0, (165) 
N62 niy2 
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and so on. Hence there are q2 roots of H(o) = 0 in the intervals 


ay Q\ a, Q) 
D ] ae > eee 3 oe 4 
m6, my1 1189, N1Yq, 


These ge roots are greater than 1, and we have seen that at least qe of 
the remaining g2 + 1 roots lie in the interval (0, 1). It is evident that 
the 2g2 + 1 roots of H(c) = 0 are positive and distinct. 

It remains to consider the root that lies in the interval (6), a:/m). 
But, from (146) and (147), with the help of (13), we obtain 


H(1) = W,(0)(Ai — a), (166) 





where A; is as defined in (26). Also, we have shown that H(6,) < 0 and 
H(a/n) > 0. Hence, if a; < A; this root lies in the interval (6), 1), and 
there are g2 + 1 roots in the interval (0, 1), and gz roots greater than 
1. If a; > A, then a;/n; > 1 and the root lies in the interval (1, ai/nj), 
and there are q2 roots in the interval (0, 1), and gz + 1 roots greater 
than 1. If a; = Aj, then one root is unity, and there are qg2 roots in the 
interval (0, 1), and g2 roots greater than 1. 


APPENDIX C 
Results Pertaining to the Generating Function 


We derive here the solutions of (97), and we first consider J = no. 
Then, 


[a2(1 — djx,) + ne - (JGy(z) 
= a2G\")(z) + no(1 — d2,)GiP)(z), (167) 


Sk 


for no + 157 S kp. It follows from (134) that 
Gy(z) = GP (2)o(S), me SjSho, (168) 


since ¢;,(¢) = 1. If we set 7 = nz in (97), and substitute from (168), we 
obtain 


1 a 1 
{| « + m(1 = *) = font = rads) bP) = Z (169) 
With the help of (13), (95), and (133), we deduce that 


a. 4 m(1 = *) = s]ont® — Neodny+i($) 


1 
= (: Fe =) — a12)Vq,(8) + nodn,($)] = (: = =) H/2), (170) 


from (20) and (25). Hence, 
(1 — z)H(1/z)Gy2(z) = 1. (171) 
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Since T(z) = 1, from (99), we have established (100) for J = no. 
Next, we consider / = kz. Then, from (97) 


a. + n(1 - ons) = roma 


= a2(1 — 8)n,)Gy-3(z) + n2Gieh(z), (172) 


for ne =] = ko — 1. But, it may be verified from (95), (99), and (133), 
that 


Ec + mo(1 - 2) - £1 
= a2(1 — S)n,)Tj-1(z) + neVj+1(z), (173) 
for J = nz. It follows that 
Gi? (z) = G(z)T(z),  msjsk, (174) 


since T,,,(z) = 1. If we set 7 = k2 in (97), and substitute from (174), we 
obtain 


(ne — $)Ts,(2) — a2Ts,-1(2)]G&(z) = — . (175) 


With the help of (13), (95), (99), and (133), this may be written in the 
form 


(2 ‘)" (1 — 2)[(m — a1z)Vq,() + nedn, (GK (z) = 1. (176) 


It follows from (170) that 
a2 


(=) (1 — 2)H(1/z)Gx2(z) = 1. (177) 
2 


Since ¢z,({) = 1, we have established (100) for 7 = ke. 
Finally, we consider nz < 1 < kp. Then, from (97), 


a. + m1 - =) = clare 


= a2(1 — §)n,)G1(z) + neGyei(z), (178) 
for nes=j =!l-—1, and 
[a2(1 — d)2,) + me — S1G)(z) = a2G}x(z) + nell — §2,)G(z), (179) 
for /+ 1<7 S ho. It follows from (134) and (173) that 


G( )= la re (2) 0; (2), ng sje il, 


G(2)4(t), << he, a 
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since T,,,(z) = 1 and ¢,({) = 1. The consistency of the representations 
for 7 = 1 requires that 


GY (a) Tilz) = Gy (2) dil). (181) 
Also, if we set 7 = Z in (97), and substitute from (180), we obtain 
(az + nz — $)Gy (2) o($) — a2GX (2) T-1(2) 
1 
- noGy. (z)brailf) = = (182) 


It remains to solve (181) and (182) for GY (z) and GY (z). 
Now, since / < kz, it follows from (134) and (182) that 


al Gi(2)g.4(6) — GX2)T A =~ 5. (183) 

If we eliminate G\)(z) with the help of (181), we find that 
a22G'y, (z)[Tr(z)or-a(S) — e($)Tr-a(2)] = —T1(2). (184) 

But, from (134) and (173), for n2<j < ko, 

(a2 + No — S)Gj(S) = a2gy-1(8) + need), (185) 
and 

(a2 + ne — §)T;(z) = aeTj-1(z) + neTj41(2). (186) 
Hence, 


ao T(z) bj-1(8) — o (8) T)-1(2)] 
= N2[Ti+1(z) (5) — gj+i (5) T;(z)], (187) 


for n2 <j < kp. Since no < 1 < ky, it follows that 
T1(Z) gr-1($) — bi($)Tr-1(z) 
Z-ny—-1 
= (“) [Trp+1(Z) bny(S) — Griti(f)Tr,(z)]. (188) 
But, from (99) and (133), 
N2[Tryt(Z)dn(S) — dnyri($)Tn,(2)] 
1 

= a + m(2 _ :) _ ‘ont —= N2gn,+1(¢). (189) 

Hence, from (170), (188), and (189), we obtain 


a2 


i: 1\.-/1 
ae[ T(z) di-1($) — o1(§)Ti-1(2)] = (2) ( = = }a(=) (190) 


ne 
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It follows from (184) that 


I-ng 
(=) (1 - au(=)aee = T,(2). (191) 
n2 Zz 
In view of (180), (181), and (191), we have established (100) for 
na< L< ko. 

We will now establish the relationship (106). From (21) and (22), it 
follows that 


[ae + n2(1 — o,) — Wr }bn,(wr) = Non +1(Wr). (192) 
Hence, from (47), (95), and (189), since T,,,(z) = 1, we have 
Tn, +1(1/0,) dn, (wr) = Gn +1(r). (193) 


As we noted in Section IV, ¢n,(w-) 4 0. Since T,,, (1/a,) = 1, we see that 
(106) holds for both 7 = nz and j = n2 + 1. But, from (47), (95), (185), 
and (186), ¢;(w,) and T;(1/o,) are both solutions of 


(ae + Ne - w,)hy = azhj-1 + N2hj+1, ne <j < ko. (194) 
It follows that (106) holds for nz =7 = ke. 
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Memoryless Nonlinearities With Gaussian 
Inputs: Elementary Results 


By H. E. ROWE 


(Manuscript received January 12, 1982) 


The distortion-to-signal power ratio at the output of a memoryless 
nonlinearity is determined by simple calculations. This is done by 
application of Bussgang’s theorem, which may also be obtained as a 
special case of Price’s theorem. Specific results are given for hard 
and soft instantaneous and envelope limiters. 


l. INTRODUCTION 


We present elementary derivations of some useful results for mem- 
oryless nonlinearities driven by Gaussian noise. The present relations 
may be obtained as special cases of more general results,’ but the 
present methods are elementary and give physical understanding. 

Let a nonlinearity have input x, output z, and nonlinear character- 
istic h(x); 


z= h(x). | (1) 


Let x(t) be a stationary, Gaussian random noise. Then it is well-known 
(Bussgang’s theorem) that the cross-correlation between input and 
output has the same shape as the autocorrelation of the input, 


(x(t + 7)z(t)) = a- (x(t + 7)x(t)). (2) 


This may be seen as a special case of Price’s theorem,” or directly as 
outlined in the appendix. 

From (2) we can write the output z(t) of any instantaneous nonlin- 
earity with Gaussian input x(t) as 


z(t) = a-x(t) + y(Z), (3) 
where y(t) will be uncorrelated with x(t); 


(x(t + T)y(t)) = 0. (4) 
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The constant a in eq. (8) is given as follows: 


x? 


e  6OA(x) dx 


“a saa la (0) 


rye (5) 
“EG =|- ; (x)dx 
where 

(x(t)) =0, = p(t) = (x(t + 7)x(E)). (6) 


ll. NARROW-BAND INPUT 


If x(t) is narrow-band, it is useful to write 


x(t) = R(t)cos[wet + 6(t)] 
7 ; Rte Mee + ; R(t)e? eet, (7) 


where R(t) and 6(¢), the envelope and angle, slowly varying compared 
to wet, are defined in terms of x(¢) and is Hilbert transform X(t) in the 
usual way: 


R(t)e7?esc! = x(t) + jX(t). (8) 
The output of the nonlinearity may then be written‘ 


= y An(R)cos(nwet + n6), (9) 
n=0 


where FR and @ are understood to be functions of ¢. The different terms 
of (9) occupy separate narrow bands, centered around their respective 
midband frequencies nw,. The output envelopes A,(#) are nonlinear 
functions of the input envelope R.‘ 

We regard the first term of (3) as the undistorted, or signal, com- 
ponent of the nonlinearity output, and the second term of (8) as 
distortion. Thus, the signal component of (9) resides exclusively in the 

= 1 term, to which we subsequently restrict our attention. We 
redefine z(t) to be the response to (9) of a zonal filter centered on w., 
and drop the subscript 1 on the coefficient of the n = 1 term of (9), to 
yield 

2(t) = A(f)cos(wet + 8) (10) 


as the output to be investigated. In eq. (10), A(R) is given by” 
2 T 
A(R) = | h(R cos £)cos BdB, (11) 
0 


in terms of the instantaneous nonlinear characteristic h( ) of (1). 
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The output [eqs. (10) and (11)] contains both signal and distortion 
components. For Gaussian input x(t), R(t) is Rayleigh, and a of (5) is 
given in terms of envelope by 

_ (1/2)(R-A(R)) 

(0) 
R 


ts a R -— 
=——_| RA(R)-—~e dR 
26(0) [ EGO) 


ny ae a R 2 R “0 
= f ie 260) |A(R)dR, (12 
[ ete + Vit V26(0) z Wy) 











erfc u = =[ dt. (13) 


The output signal power S is given by 
S = a’-9(0). (14) 


The constant a in eq. (14) may be calculated from either (5) or (12), 
whichever is more convenient. The second line of (12) is equivalent to 
the m = 1 case of eq. (9) of Ref. 1. 

In eq. (10), z(t) contains both the signal component 


a-x(t) = a-R(t)cos[w. + A(£) | (15) 


and a distortion component, of power D. To find this, we determine 
the total power S + D in z(t) of (10), and subtract S. We have 


R? 


S+D =5 (A?(R)) = An A*(R)- ae e »OdR, (16) 
Since S <= S + D, we must have from (12), (14), and (16) 
(R-A(R))? = 26(0)-(A*(R)). (17) 
Noting that 
26(0) = (R*), (18) 


(17) is simply the Schwarz inequality. 


ill, HARD (INSTANTANEOUS OR ENVELOPE) LIMITER 
The instantaneous nonlinear characteristic of (1) is: 


1, x>0. 


are x<0. (19) 


h(x) = sgn x = 
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The corresponding envelope characteristic of (11) is: 
4 
A(R) =-, R>0. (20) 
T 


The signal gain a of (3) is, by the second relation of either (5) or (12): 


2 
«= Vie oe 


The output signal power S is by (14): 





S= Z (22) 
oi 
Finally, the total power in the first zone is given by (16) 
8 
T 
Therefore, 
D va 
=l]--. 2 
S+D : 4 24) 


IV. SOFT INSTANTANEOUS LIMITER 
The instantaneous nonlinear characteristic of (1) is: 
a |x| =X. 








A) |x| =X. 22) 
This signal gain a is from the second relation of (5): 
2 x 
a = —-erf ——, (26) 
vz V26(0) 
where 
a ar 
erf u = — e dt=1-erfcu. (27) 
Vr Jo 
V. SOFT ENVELOPE LIMITER 
The envelope nonlinear characteristic of (11) is: 
R, O= REA. 
AW) = 4 AX<R<~, ai 
The signal gain a is, by the second relation of (12): 
hes ae 2 
ie i -erfc a +1l—e 24(0), (29) 


2 2600) V26(0) 
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By (14), the output signal power is: 








da A A sie? 
S = 6(0)- IF -erf +1-—-— 0 | ! (80) 
2 ae) 


This is equivalent to eq. (24a) of Ref. 1. The total power in the first 
zone is by (16): 





A? 
$+D=9(0)-|1-¢ | (31) 
Therefore, for the soft envelope limiter (28): 
D 
S+D 








ee. A “307 /| “a0 
1 — |— erfc +1l—e 29(0) 1—e 2410) |, (82) 

| 2 J26(0)  — V2$(0) 

Asymptotically: 
A? 
D 1 -—— A? A? 

~ =e 20) eae se 33 
S+D 2° /|sa} 260) 
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APPENDIX 
Derivation of Eq. (2) 

A special case of Price’s theorem? may be stated as follows.’ Let x1 
and x2 be jointly Gaussian random variables with first and second 
moments given as follows: 


(x1) = (x2) = 0. 
(xt) = (x3) = Ru. (34) 
(x1X%2) = Rie. 
The normalized covariance is 
Rye 
=, 3 
P12 Ru ( 5) 


Let x; and x2 be passed through two different instantaneous nonlin- 
earities with characteristics h,; and hz, yielding outputs /i(x1) and 
h2(x2) with output cross-variance 


WY = (Ai(x1)he(x2)). (36) 
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Then 


Be Ge aA). (37) 
Opie 
where 
dha (x) «dhe (x) 
Kia) = 2 halen) = 2 : (38) 


To derive Bussgang’s theorem set: 





hi(x) = h(x). 
ho(x) = x. 
x, = x(t). 
Xe =x(t+7). (39) 
x(t) = stationary Gaussian noise. 
ola) = (ale + ates p(t) = 20. 
Substituting (39) into (34) to (38), 
= = aa h’(x)) = constant. (40) 
Integrating, 
W(r) = (x(t + t)A(x)) = constant-¢(7), (41) 


the result given in (2). 
The following alternative derivation, without using Price’s theorem, 
was given by Jack Salz. From (2) 


_ (x(t + 7)2(t)) 


~ (x(t + r)x(t)) 2) 


Express the nonlinear characteristic of (1) as the contour integral of 
its transform;* 


z=h(x) = x| F(ju)e?*“du. (43) 


c 


Write 


= —— e/%* (44) 


J dv 
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Then, 


Gene) Sin | F(ju) (ery dy 
27] v0 AU Cc 


= (1) | F(ju)ue™? "du, (45) 


Cc 


the same result as (41). Substituting in (42), 


Gat | F(ju)ue~*? du. (46) 
27 , 


Equation (46) is readily shown to be equivalent to (5), by substituting 
the transform relation (48) into (5) and interchanging the order of 
integration. 
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A Variation on CSMA/CD That Yields Movable 
TDM Slots in Integrated Voice/Data Local 
Networks 


By N. F. MAXEMCHUK 


(Manuscript received January 6, 1982) 


A variation on the carrier-sense, multiple-access/collision-detec- 
tion (CSMA/CD) protocol for local-area, random-access broadcast 
networks is presented. The new protocol supports any mix of voice 
and data traffic, and has an upper bound on the delay of periodic 
traffic. In conventional CcsMA/CD systems, statistical fluctuations in 
the transmission delay may cause disruptions in reconstructed voice 
waveforms. This does not occur with the new protocol. With this 
protocol, data sources use conventional csMA/ cD techniques to access 
the channel, while voice sources use a subset of these protocols, and 
appear to acquire a time-division multiplexed (TDM) slot. As in TDM 
systems, the entire system capacity can be used by the periodic 
sources. Unlike TDM systems, periodic slots are repositioned when 
other transmissions interfere with them, timing discrepancies between 
sources can be tolerated, and a centralized controller is not required 
to assign slots. Simulations are used to compare the delays in this 
system with those in conventional CSMA/CD systems. It is shown that 
at link utilizations of 0.9, voice sources can operate in a time-assign- 
ment, speech-interpolation mode. The effect of periodic traffic on 
conventional CSMA/cCD protocols is also determined. 


l. INTRODUCTION 


There are basic differences between the requirements for data and 
voice transmission. There are also differences between the capabilities 
of local networks and the functions that should be performed on these 
networks, and general global networks. A protocol is described for 
integrating data and voice in a local-area, random-access broadcast 
network. It enables a single system, with similar interfaces for voice 
and data, to be used for both. The protocol satisfies the transmission 
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requirements of both media, and takes advantage of the characteristics 
of local networks. This protocol can be applied to any system with 
periodic and aperiodic sources, as long as all of the periodic sources 
have the same transmission requirements. 

In general, transmission capacity is less expensive in a local environ- 
ment than in the global environment. This has resulted in significant 
differences in the types of networks implemented for data transmission 
in the two situations. In global networks, expensive access and switch- 
ing techniques have been used to minimize the network capacity 
required to effect communications, whereas in local networks simpler 
switching techniques and less costly access nodes have been used. 
Several global networks, which use store-and-forward and packet- 
switching techniques to reduce the transmission requirements, are 
described in Chapters 1 and 2 of Ref. 1. A survey of techniques used in 
local networks, and an extensive bibliography is given in Ref. 2. Many 
of these local networks are configured as loops or random-access 
channels. Switching is simplified, but high-bandwidth communications 
channels are required. 

The proximity of nodes in local networks leads to smaller propaga- 
tion delays than those that occur in global networks. This has resulted 
in the development of more efficient random-access broadcast tech- 
niques for local networks®* than have been developed for global 
networks (see Ref. 5 for a survey of random-access techniques and 
additional references). 

Most proposals for integrated digital voice and data networks have 
applied to global networks. A survey of this work and an extensive 
bibliography can be found in Ref. 6. These proposals use variable-rate 
speech-coding techniques,’”® packet-switching techniques,””° or com- 
plex strategies for combining circuit and packet switching.'’ These 
techniques are expensive to implement and should not be considered 
for local area communications. As in local data networks, integrated 
networks should take into account the reduced transmission costs and 
propagation delays in a local environment. In addition, only simple 
speech-coding techniques should be considered. 

The requirements for digital voice transmission, both in terms of 
capacity and delay, are significantly different from those of data. The 
distribution of data traffic on local networks is typically bimodal,” 
comprising short interactive messages and long file transfers. The 
traffic from these sources arrives sporadically. If the message is divided 
into packets, a variance in the packet delay can be tolerated, providing 
the entire message delay is not excessive. 

Uncompressed, digitized telephone calls require a large number of 
bits to be transferred, with much more stringent delay requirements 
than data sources have. Using a 32-kb/s speech coder, and transmitting 
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only during active speech intervals, over 4-4 Mb of data must be 
transmitted during a 3-minute telephone call. The data cannot be 
accumulated over the entire call and then transmitted as a large file 
transfer between computers, because the participants interact. The 
maximum delay allowed in current telephone connections is on the 
order of a few hundred milliseconds. However, in a local network, the 
maximum delay must be significantly less than this, since the connec- 
tion may also use an outside facility. 

In a packet voice system, overhead bits must be transmitted in 
addition to the information from the digitized voice source. The more 
voice samples included in this packet, the higher the ratio of infor- 
mation to total bits in the packet, and the higher the transmission 
efficiency of the channel. However, the more voice samples included 
in a packet, the greater the delay between the time a sample is 
generated and the time it is delivered to the receiving telephone. As a 
compromise, packets consisting of several tens of milliseconds of 
speech are used in the systems described in this paper. 

The variance of the delay in digital speech systems must also be 
constrained. The digital-to-analog converter at the receiver uses sam- 
ples at a fixed rate. If a packet of samples is delayed to the extent that 
the previously transmitted samples are completely used up before it 
arrives, a discontinuity occurs in the speech. The probability of this 
occurring can be reduced by delaying the first packet of voice samples 
that arrives at the receiver and buffering future packets until they are 
needed. If the maximum delay is not constrained, this technique can 
reduce, but not eliminate, the problem. This delay adds to the overall 
delay between the speaker and the listener, and must be kept small. 

In the global networks referenced earlier, packets of voice that do 
not arrive in time are lost. It is argued that if a small percentage of 
voice packets are discarded at random, the resultant distortion is 
tolerable. This same argument is used in proposals to integrate voice 
and data on local networks.'*"* It may be valid to assume that voice 
packets are lost at random when aperiodic data sources generate most 
of the network requirements, or when the periodicity of speech sources 
is reduced by variable-rate speech-coding techniques. However, this 
assumption will probably not hold in a local network. 

Based upon the measured data utilization of local networks” and 
the anticipated transmission requirements of digital voice, voice pack- 
ets will most likely dominate integrated local networks. Since voice 
packets are generated periodically, if packets from voice sources col- 
lide, they are likely to continue colliding on successive transmissions. 
Therefore, successive delays from the same source will be correlated. 
Voice sources that do not contend with other sources may have a small 
average and variance of delay, while those that contend with a large 
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number of voice sources may have a large average and variance of 
delay. If networks are designed based upon an acceptable average level 
of lost packets, and these packets are concentrated among a small 
number of connections during a small period of time, rather than being 
distributed randomly, the resultant service may not be acceptable. 

Instead of reducing the periodicities to achieve fair packet losses, 
the periodicities can be used to eliminate lost packets entirely. This is 
accomplished by a variation on the carrier-sense, multiple-access/ 
collision-detection (CSMA/CD) transmission protocol defined in Section 
II. This protocol transmits data by conventional csMA/cD techniques, 
but uses only a subset of these techniques to transmit voice. The 
periodic sources do not detect collisions. In addition, periodic packets 
are tailored as described in Section III, the length of data packets is 
constrained, and periodic packets are given a higher retransmission 
priority. In Section IV, we show that this protocol limits the delay of 
voice packets to one data packet transmission time. 

Periodic sources using this protocol operate as if a time-division 
multiplexed (TpM) channel has been assigned to each source. The 
difference between this channel and standard TpM channels is that it 
is not locked solidly into a time slot. The time slot may be shifted 
slightly back in time. When this occurs, the voice samples that arrive 
during the shift are transmitted in an expanded information area. An 
interesting characteristic of this system is that a periodic source can 
gain access to a system that does not appear to have the capacity to 
handle another periodic source. The system does not fail, but begins 
to operate as a completely utilized TpM system, with a slightly longer 
slot period. This phenomenon is explained in Section V. Another result 
of the time slot mobility is that timing discrepancies can exist between 
periodic sources without time slots being overwritten. This type of 
operation is described in Section VI. 

In this system, if a periodic source and an aperiodic source are 
waiting to use a busy channel, the periodic source has a higher access 
priority and acquires the channel first. Typically, in queueing systems, 
when the delay of one class of users is reduced by raising its priority, 
the delay of the remaining users must increase. A simulator, which is 
described in Section VII, has been written to study the delays in this 
type of system. The model for the sources in the simulations is 
explained in Section VIII. The delays of periodic and aperiodic sources 
in this system are compared with the delays incurred when all sources 
use the same access protocol. These results are reported in Section IX. 
In Section X, alternative protocols are described. 


li. TRANSMISSION PROTOCOL 


In this protocol, all of the packets from an aperiodic source, and the 
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first packet from a periodic source that is beginning to transmit, use a 
csMA/CD protocol. Before transmitting, the source listens to the chan- 
nel and refrains from transmitting if the channel is busy. While 
transmitting, the source also listens to the channel and stops trans- 
mitting if a collision with another source is detected. If the channel is 
busy, or if a collision occurs, the source tries again after the channel 
becomes idle. This protocol is not dependent upon the specific retry 
strategy these sources use to resolve contention. Any of the conven- 
tional strategies will work. 

After the first transmission, a periodic source transmits all of the 
data it has accumulated whenever it acquires the channel. The source 
schedules its next transmission attempt at a fixed time T, after the 
last successful transmission. In the first description of the transmission 
protocol, it will be assumed that T, is the same for all sources. The 
effect of timing errors will be investigated in Section VI. 

A periodic source listens before transmitting and defers transmission 
rights to terminals that are currently transmitting. However, this 
source does not listen while transmitting, and never terminates trans- 
mission prematurely. Instead, the packet structure for a periodic source 
is designed to allow aperiodic sources to detect a collision and termi- 
nate transmission before the periodic source begins transmitting useful 
data. When busy channels are encountered, the periodic source begins 
transmitting as soon as the channel becomes idle. This protocol, in 
conjunction with a constraint on the packet size from aperiodic sources, 
prevents packets from periodic sources from colliding. 

As periodic traffic enters a system, the system becomes a TDM 
system. A periodic source acquires the channel and periodically uses 
a slot starting at this time until it is delayed by aperiodic traffic, or 
another periodic source starting to transmit. At this time, the periodic 
slot scheduled for the periodic source is shifted slightly. Additional 
data are transmitted in the first delayed slot to compensate for the 
shift. 


I. PACKET STRUCTURE 


The packets from aperiodic sources, Fig. 1a, consist of overhead bits 
and data bits. The overhead bits contain synchronization bits, source 
and destination addresses, packet-length counts, packet sequence num- 
bers, error-control bits, and any other functions required by the com- 
munications protocols. These are variable-length packets. They are 
constrained to be less than or equal in length to the packets from the 
periodic sources. 

Packets from periodic sources, Fig. 1b, consist of preempt bits, 
overhead bits, data bits, and overflow bits. During the preempt inter- 
val, the periodic source places a signal on the transmission media but 
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OVERHEAD DATA 


(H, BITS) (1, BITS) 
(a) 
PREEMPT OVERHEAD DATA OVERFLOW 
(Pp BITS) (Hp BITS) Up BITS) (Op BITS) 
(b) 


Fig. 1—(a) Packets from aperiodic sources. (b) Packets from periodic sources. 


does not send information. This interval is long enough for an aperiodic 
source to detect a collision, stop transmitting, and have the effects of 
the transmission removed from the system before the periodic source 
begins transmitting useful data. 

The length of the preempt interval, 7,, is 


Tp = 27: + Ton + T1+ Tof, 


where 

7: is the maximum one-way propagation delay in the medium, 

Ton iS the time required for a signal to be detected once it has 

propagated to a position on the channel, 

tT; is the time before the hardware reacts to a collision, 
and 

Toff is time for a signal that is turned off to stop affecting a receiver, 

after the end of signal has propagated to the receiver. 

Times 7, and 7,4 take into account electronics delays in the receivers 
and distortion in the received waveform caused by finite bandwidth 
channels which may not be correctly terminated. The preempt interval 
in a 3-Mb, 1-km simulated system is 38 bits long. 

The overhead segment for periodic packets will be smaller than the 
overhead segment for aperiodic packets. An error-control strategy may 
not be used for periodic sources, since retransmitted packets will 
probably arrive too late to be used, and a greater error rate can be 
tolerated in sampled voice than in data. Packets will not arrive out of 
sequence; therefore, sequence numbers are not necessary. In addition, 
the number of bits of data in packets from these sources is determin- 
istic, so that the length field is not required. In the simulations, Hs = 
100 bits and H, = 40 bits. 

When a periodic source acquires the channel, it transmits all the 
data it has accumulated. The source schedules its next transmission 
T, seconds after a successful transmission. If the channel is not busy 
at this time, the samples it has collected fit in the data area. If the 
channel is busy, it waits before transmitting. The samples that arrive 
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during this time are transmitted in the overflow area. Even when there 
are no overflow samples to be transmitted, the source transmits carrier 
during the overflow time. This guarantees that a periodic source takes 
no more time to transmit when it is delayed than when it acquires the 
channel immediately. 

The size of the overflow area is determined by the maximum delay 
a periodic source can experience. This transmission protocol guaran- 
tees that the maximum delay for a periodic source will not exceed a 
packet transmission time. In the system simulated, the periodic sources 
generate 8000 4-bit samples per second, and T, is 30 ms. A maximum 
of four samples arrive during a packet transmission interval; therefore, 
the overflow area is 16 bits. 

The first packet from a periodic source may be Sten than succes- 
sive packets since it does not include a preempt interval or an overhead 
interval. However, the same packet size will be maintained for this 
packet as for all other periodic packets. This guarantees that the 
scheduled interval between the next packet from this source and a 
packet from another periodic source is at least a periodic packet 
transmission time, Xp. 


IV. CHANNEL CONTENTION 


Delay is incurred in CSMA/cD networks when the channel is busy 
and when transmitting sources collide. Systems with periodic and 
aperiodic sources are being considered. Therefore, a periodic source 
can be delayed because 

(¢) the channel is busy transmitting an aperiodic packet, 
(tz) a collision with an aperiodic source occurs, 

(iz) the channel is busy transmitting a periodic packet, or 

(tv) a collision with a periodic source occurs. 

In this section, it is shown that only the first and third of these four 
mechanisms delay periodic sources, and that the maximum delay is 
less than X,. It is assumed that every periodic source has the same T>. 
The effects of timing inaccuracies are investigated in Section VI. 

Whenever a periodic source and an aperiodic source collide, the 
aperiodic source detects the collision and stops transmitting before the 
periodic source begins transmitting useful data. Therefore, a periodic 
source is not delayed by a collision with an aperiodic source. 

Whenever a periodic source and an aperiodic source are waiting for 
an idle channel, the periodic source obtains the channel. The periodic 
source begins transmitting as soon as the channel becomes idle. If an 
aperiodic source waits, it detects a busy channel and does not transmit. 
If an aperiodic source begins transmitting immediately, it detects a 
collision and stops transmitting. Therefore, a periodic source can only 
be delayed by an aperiodic source whose transmission is already in 
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progress. This delay is at most the packet transmission time for an 
aperiodic source, Xa, which is constrained to be less than X;,. 

Consider a sequence of & periodic sources scheduled to transmit at 
times 


ti, fo2, -++ , tap. 


Let 
t; ; be the time the transmission from source 1 is scheduled to appear 
at the location of source 7 (¢;,; — ti; is the propagation delay from the 
ith to the jth source), 
ti; be the time the transmission from source 1 actually appears at 
location 7 (this takes into account the channel acquisition delay for 
the zth source), 
and, 
D; = ti; — ti; be the delay of the ith source. (Since the propagation 
time is independent of the delay, the delay between the scheduled 
and actual arrival is the same for each destination /, and the second 
subscript is not needed.) 
The transmission from each periodic source lasts X,, and each 
periodic source schedules its next transmission T, after its last suc- 
cessful transmission. Therefore, 


tities — fier = Xp: 


If D; = 0, the zth periodic source does not delay the (z + 1)th periodic 
source, and, as long as 


D; < Xp 


these two sources will not collide. 

Periodic sources cannot be delayed by periodic sources which have 
not been delayed. Therefore, the first periodic source to be delayed 
must be delayed by an aperiodic source. The delay incurred by the 
first periodic source is less than X,. The effect of this delay may 
propagate and effect a sequence of periodic sources. In a general 
sequence of periodic sources, if 


D,< Xp, 
then, 
bina < bist; 


and, the ith and (zt + 1)th do not collide. The transmission time 
required by the ith source is X,, even though it is delayed, and must 
transmit more samples. If 


tA 
tiiti + Xp S litaisi, 
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then the (z + 1)th is not delayed by the ith periodic source. This source 
may be delayed by an aperiodic source, and start a new sequence of 
delayed periodic sources, but the delay it incurs will be less than X,. If 


tiiet + D Oe raere es 
the delay incurred by the (z + 1)th source is 
Di+1 => bien i Xe pe C++. 


Since the (i + 1)th source is waiting for the channel, this delay cannot 
be increased by an aperiodic source. The delay can be written as 


Diay = Di + tiina — bisaier + Xp. 
Since 
tiie — tistiei1 + Xp = O, 
Dizi = Di. 


Therefore, the delay incurred by a sequence of periodic sources is a 
nonincreasing function, the maximum delay incurred by a periodic 
source is less than X,, and periodic sources do not collide. 


V. OVERLOAD TRAFFIC 


Consider a system operating in a mode in which the channel capacity 
is nearly completely used by periodic sources. Assume that a time gap 
remains that is large enough for another source to begin transmitting, 
but not large enough to transmit an entire packet. Let another periodic 
source acquire the channel at this time. Is a failure mode created? Do 
delays build up until periodic sources start colliding? Is data lost? No. 
Instead, the system begins to operate without time gaps. The period 
between channel acquisitions increases, and some or all of the overflow 
bits in every packet are always used. Whenever a periodic source can 
acquire the channel, its ability to transmit is guaranteed. 

Let a periodic source begin transmitting in a small time gap, such as 
that described in the hypothetical example. The periodic source it 
delays is delayed less than X,. This source transmits the data accu- 
mulated during the delay in its overflow area and schedules its next 
transmission T, seconds after it successfully acquires the channel. 
Successive periodic sources are delayed by an amount less than or 
equal to the delay incurred by the preceding source, as shown in the 
previous section. 

The original interfering source becomes just another source in the 
sequence of interfering sources. It can be delayed by no more than the 
delay it originally caused, and can delay the source following it by no 
more than it did originally. Since the delay is a nonincreasing function, 
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and it cannot go to zero for the over-utilized channel, it must stabilize 
at some positive time, €, which is the same for all sources. The delay, 
€, is equal to X,, minus the sum of the idle channel times for a period 
T, before the overflow source entered the channel. Using the termi- 
nology from the preceding section: 
€= Xp = > [tisaitt _ (tii+1 Tt Xp) |. 
tii+1€Tp 
When the stable situation occurs, each periodic source transmits a 
packet every T, + € seconds. It transmits the samples that have arrived 
in this period of time in the data and overflow area. At the end of each 
transmission, there is a periodic source that has been waiting € seconds. 
This source acquires the channel before an aperiodic source or the first 
packet from another periodic source can. Until one of the sources ends 
its transmission, and channel capacity becomes available, the system 
operates as a TDM system with a slot period of J, + € seconds. No data 
are lost, and the slot delays do not grow indefinitely. 


Vi. TIMING CONSIDERATIONS 


In a sampled data communication system, it is necessary for the 
transmitter and receiver to be frequency locked, so that samples are 
used at the same rate at which they are generated. In broadcast 
networks, this can be achieved by sending a clock signal outside of the 
data band or by using a modulation rule with a clock component. The 
former technique provides accurate timing, but requires that one unit 
be responsible for inserting the clock on the system. In the latter 
technique, there is no centralized control, every transmitting unit is 
identical, but timing discrepancies may exist between the transmitters, 
particularly when very little data are being transmitted. 

Timing discrepancies result in the periodic terminals having differ- 
ent estimates of the interpacket interval 7. Let the interpacket 
interval for the ith periodic terminal 7;,, be within ¢€ of T,,, so that 


|Tip — Tp| Ss. 
Let the zth and (i + 1)th periodic terminals transmit at 
Citi = 0, 
and, 
tisii+1 = t + Xp, 


so that there is no separation of the packets at the (7 + 1)th source. 
The next packets from these terminals are scheduled at 


tina = t+ Tip, 
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and, 
Ci+1i+1 =t+ Xp + T isin 


These two times may be separated by as little as X, — 2e. If the first 
packet in this sequence is delayed by a packet from an aperiodic 
source, it may be delayed until 


th ita < t + Digs + on 


With the constraint X, = Xp, it is possible that both periodic sources 
will be waiting for the channel and will collide. This situation cannot 
be resolved by the protocol. It can be prevented by constraining the 
length of aperiodic packets to 


Xa = Xp — 2e. 


With this constraint, it can be shown, as in Section IV, that sequences 
of periodic sources do not collide, and that the delay of a periodic 
source does not exceed X;. 

This length constraint cannot be used to solve all of the problems 
that arise because of timing inconsistencies. The first packet from a 
periodic source must have length X, to reserve an entire slot, but must 
enter the system like a periodic packet. This can be resolved by 
allowing periodic sources to recognize and preempt the first packet 
from a periodic source that begins transmitting within 2e of the periodic 
source’s scheduled transmission. Now, the maximum delay caused by 
the first packet from a periodic source is not greater than that caused 
by a packet from an aperiodic source. 


Vil. SIMULATION 


A general-purpose simulator has been written to determine the 
characteristics of broadcast networks. In this simulator, the protocols 
used by sources, the traffic generated by a source, the physical position 
of a source on a channel, the parameters that define a transmission 
channel, and the interconnection of channels can be varied. 

In the simulator, time jumps between the occurrence of significant 
events, rather than continuously moving forward by small increments. 
Functions such as wait for idle channel and listen while transmitting 
are implemented by giving the channel the characteristics of an active 
device. The channel notifies a source when the state of the channel, at 
the location of the source, has changed. Actually, the channel in this 
type of system is passive and sources that perform these functions 
examine the channel continuously. However, the interval of time in 
which the state of the channel can change and a significant event can 
occur is small compared to the average time between significant events. 
A simulator that models the continuous operation of the sources would 
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have to examine the channel at small intervals, and would be much 
less efficient, in terms of processing, than the technique selected. 

In this simulator, the channel maintains lists of sources that are 
transmitting, listening while transmitting, and listening for an idle 
channel. When the state of the channel changes, channel maintenance 
routines notify the terminals in the lists at the appropriate times. 
These routines take into account the propagation delays between 
terminals, and the detection time of terminals. 

Queues of messages and packets waiting to be transmitted from the 
sources are maintained without creating physical queues. Instead, the 
clocks associated with packet and message generation are allowed to 
run independent of the simulation clock. Each source keeps track of 
the time the packet and message it is transmitting were generated. 
When the transmission is complete, this time is updated to determine 
the next packet or message arrival at this source. If this time is less 
than the simulation time, this object has been waiting in a queue, and 
can be transmitted immediately. If it is greater than the simulation 
time, the queue is empty, and the message or packet is scheduled to 
arrive in the future. 

The periodic terminals approximate voice terminals that only trans- 
mit during active speech intervals. For the parameters selected in the 
simulations, the speech interval durations are more than three orders 
of magnitude greater than the packet transmission time, and more 
than six orders of magnitude greater than certain of the channel- 
related events. The simulator tracks the fine structure of channel- 
related events. Because of the different time scales, the simulations 
could not be conducted for a long enough time to guarantee that all 
combinations of active and inactive intervals occur with the proper 
probabilities. To compensate for this, a different random-number 
generator is used to generate active and silent intervals than is used to 
generate the other random events in the simulation. This allows 
different protocols for periodic sources to be compared when the same 
active and silent intervals occur. 

The systems simulated have the following characteristics: 

(t) Sources are uniformly distributed along a single cable. 
(zz) Channel length is 1 km. 

(tit) Transmission rate is 3 Mb/second. 

(tv) Time to detect the presence and removal of carrier, Ton and To, 
is 1 ps. 

(v) Time to detect interference, 7, is 4 ps. 

(vit) Time the channel must remain idle between transmissions is 
one-third of a microsecond. 

These parameters do not correspond to a particular hardware real- 
ization of a system, but are indicative of what might be expected. 
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VII. SOURCE MODELS 


Aperiodic and periodic terminals are simulated. They correspond to 
a number of data sources sharing a store and forward node, and a two- 
way voice conversation. Some of the parameters associated with these 
terminals are selected to study specific characteristics of the system or 
to facilitate programming, rather than to accurately model the source. 
The results of the simulations provide an indication of how the system 
operates and not precisely how many terminals of a specific type can 
be supported. 

The model for the sources in this system is shown in Fig. 2. In the 
remainder of this paper, the packets from the various sources are 
referred to as follows: 

Type A—all of the packets from aperiodic sources. 

Type F—the first packets in active intervals from periodic sources. 

Type P—all other packets from periodic sources. 

Type X—a composite of all of the packets from periodic sources. 

Type T—a composite of all of the packets from all sources. 

The protocols for periodic sources are referred to as follows: 
II,—Periodic sources use the protocol proposed in this paper. 
IIz—Periodic sources use the same protocol as data traffic and 

transmit enough overhead bits to have the same packet size as I. 
IIs—Periodic sources use the same protocol as data traffic, but 

transmit fewer overhead bits than ITz. 

IIx—Either II, or IIs. This symbol is used when describing the 
intersection set of these two protocols. 

The doublet (x, y) refers to packets of type x, when the periodic 

sources use protocol y. 

In the model, all of the sources test the state of the transmission 
medium before transmitting (LBT). If the channel is currently being 
used (B), they wait until it is idle (wic), then schedule a transmission 
retry according to a source-specific strategy (SRB). Transmission prior- 
ities can be established by using different strategies for different 
sources. For instance, if one type of source can retry sooner than 
another, these sources are more likely to find the channel idle and will 
have a smaller average delay. For this set of simulations, all of the 
sources begin transmitting as soon as the channel becomes idle. If the 
channel is not heavily used, there is seldom more than one source 
waiting, and this will lead to the smallest possible delay. If more than 
one source is waiting, they collide and the collision retry strategy 
determines which source obtains the channel. 

Transmitting sources may or may not listen while transmitting. 
In this set of simulations, sources listen while transmitting type A, 
(F, IIa), and (X, IIx) packets. When a collision is detected, a source- 
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TEST-TAKES NO TIME. 


— EVENT WHICH MAY TAKE NONZERO TIME. 
C) CONDUCTS A TEST WHILE AN EVENT IS IN PROGRESS. 
LBT — LISTEN TO THE CHANNEL BEFORE TRANSMITTING. 
LWT — LISTEN TO THE CHANNEL WHILE TRANSMITTING. 
XMIT — TRANSMIT. 
WIC — WAIT FOR IDLE CHANNEL. 
SRB — SCHEDULE A RETRY AFTER A BUSY CHANNEL. 
SRC - SCHEDULE A RETRY AFTER A COLLISION. 
SNP — SCHEDULE TRANSMISSION OF NEXT PACKET. 
SNM — SCHEDULE TRANSMISSION OF NEXT MESSAGE. 
— CHANNEL BUSY. 
— CHANNEL NOT BUSY. 
COLLISION DURING TRANSMISSION. 
~ NO COLLISION DURING TRANSMISSION. 
LP — TEST IF THE TRANSMITTED PACKET WAS THE LAST PACKET IN THE MESSAGE. 
LM — TEST IF THE TRANSMITTED MESSAGE WAS THE LAST MESSAGE IN THE QUEUE. 
WNM — WAIT FOR NEXT MESSAGE. 
TLWT- TEST IF THIS TERMINAL 1S TO LISTEN WHILE TALKING. 


ao wa 
| 


Fig. 2—Model of the transmission strategies used by the sources in the simulations. 


dependent retry strategy (SRC) is initiated. The sources do not listen 
while transmitting type (P, IL) packets. 

The algorithm, src, provides another opportunity to assign trans- 
mission priorities. Sources generating type A and (X, IIx) packets 
distribute their retrys uniformly over an interval 2'Ar, where i is the 
number of collisions this packet has experienced, and Ar is two round- 
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trip transmissions delays. The value of 2 is limited to ten, so the retry 
interval varies from two to 1024 round-trip delays. Sources generating 
type (F, IIs) packets distribute their retrys uniformly over an interval 
Ar. Under heavy-load conditions, these souces retry sooner than those 
generating type A packets and have a smaller average delay. The 
periodic sources model voice terminals that only transmit during active 
speech intervals. Type (F, IIs) packets are given a higher priority 
because they must acquire the channel in less than 50 ms more than 
98 percent of the time.’*’* It is shown in Section IX that this require- 
ment is met at channel utilizations of 0.9. 

In this model, after a packet is successfully transmitted the source 
determines if this is the last packet in a message. If it is not, the next 
packet transmission is scheduled. If it is, the next message is scheduled. 
For the periodic sources, a message corresponds to an active interval 
during which packets are generated. After the last packet in an active 
interval is transmitted, there is never a message waiting. Instead, a 
silent interval is generated, and lasts until the next message arrives. 
Type (P, IIx) packets are generated every T, seconds after the first 
packet successfully acquires the channel. These packets are queued 
until they can be transmitted. Type (P, IT4) packets are generated T, 
seconds after the last successful channel acquisition. 

The aperiodic sources generate variable-length messages. The mes- 
sages arrive independently and are queued until they can be transmit- 
ted. The message length determines the number of packets in the 
message and the length of the last packet. The source transmits all of 
the packets in a message before determining if another message is 
waiting. The packet scheduling algorithm (SNP) waits before transmit- 
ting successive packets from this source. This enables other sources to 
acquire the channel. 

The aperiodic sources in these simulations have the following char- 
acteristics: 

(t) The message interarrival process is a negative exponential 
distribution with a mean of 33.47 ms. 

(wz) The packet size is 1050 bits, 90 overhead bits, and 960 data 
bits. 

(uz) The message length is deterministic—it is 960 bits. (Each 
message generates a single maximum-size packet.) 

(tv) The minimum time between successive packets from the same 
source is four round-trip transmission delays. 

These characteristics do not correspond to any known data source. 
They are selected to allow comparisons between systems with different 
mixes of periodic and aperiodic sources. The average number of bits 
transmitted by one of these sources is equal to the average number of 
bits transmitted by a periodic source. This allows the network utiliza- 
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tion to remain constant, while the mixture of periodic and aperiodic 
sources is varied. From this, the effect of periodic rather than aperiodic 
requirements on a broadcast network is determined. Using these 
characteristics, type A packets are almost the same size as type 
(X, II4) packets. This causes the largest possible delay to be incurred 
by the periodic sources. 

The periodic sources have the following characteristics: 

(t) For all of the sources, T, is set to 30 ms. This results in 960 
data bits per packet. 

(it) The duration of silent and active intervals are exponentially 
distributed with means of 0.185 and 1.31 seconds, respectively. 

(iit) For type (X, Ila) packets, H, = 40 bits, O, = 16 bits, and 
P, = 38 bits. This results in 1054-bit packets. 

(tv) For type (X, IIz) packets, H, = 94 bits. This retains the same 
link utilization as aperiodic sources and periodic sources using the new 
protocol. 

(v) For protocol I1,, H, = 40 bits. This shows the effect of increas- 
ing the link utilization in the new protocol. 

The mean values of the distributions are selected so that the total 
active time, the average active interval, and the number of active 
intervals correspond to those in a two-way telephone conversation.” 
This model assumes that the two speakers in a conversation do not 
talk simultaneously. It ignores the 0.07 probability of double talk. This 
model is easier to implement than one which allows double talk and 
should provide a reasonable indication of the system delays. 


IX. SIMULATION RESULTS 


Simulations have been conducted for protocols II,, II,, and Iz. The 
fraction of the requirements generated by periodic sources was set to 
0, 10, 25, 50, 75, 90, and 100 percent of the total requirements and the 
nominal offered utilization, S, to 0.7, 0.8, and 0.9. For protocols II, and 
IIz, the actual values of S are very close to the nominal values. For 
protocol II,, the utilization generated by aperiodic sources and the 
number of periodic packets is the same as for the other protocols. 
However, the overhead associated with periodic packets is reduced. 
Therefore, the actual value of S is less than the nominal value. 

Statistics were taken for 10 seconds of elapsed time after a 2-second 
initialization period. As the utilization varied from 0.7 to 0.9, the total 
number of packets in the simulations varied from about 20,000 to 
25,500. The time between successive starts of active intervals for the 
periodic traffic is on the order of a second or two. It is unlikely that all 
combinations of active and inactive periodic terminals are witnessed 
in a 10-second interval. However, the same active intervals occurred 
for each of the protocols, and the data show definite trends. The 
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average and standard deviation of the channel access delay are plotted 
as a function of the percentage of the load generated by periodic 
sources in Figs. 3 and 4. 

For Iz, the periodic and aperiodic sources use the same transmission 
protocol and have the same packet lengths. For this protocol, the 
network utilization remains constant as the load shifts from aperiodic 
to periodic sources. All of the curves (T, IIz) display the same char- 
acteristic. For a small fraction of periodic load, the delay measures 
increase. As the fraction of the load generated by periodic sources 
becomes large, the delay measures decrease. 

This behavior can be accounted for by characteristics of periodic 
sources. As the periodic sources change between the active and inactive 
mode, the short-term utilization changes. The delay versus utilization 
curve for this type of system is concave upward so that the delay 
averaged over a number of utilizations is greater than the delay 
associated with the average utilization. This accounts for the initial 
increase in the curves. The periodic sources transmit packets at a 
multiple of T,, after the first successful transmission. This separates 
sources so that a large number of sources do not collide. Since it takes 
longer to resolve contention between larger numbers of sources, the 
average delay decreases as the number of sources that may be sepa- 
rated increases. This accounts for the decrease in the delay measures 
when the fraction of the load generated by periodic sources exceeds 
ten percent. 

Curves (7, II4) show the delay characteristics for all of the packets 
in a system that uses the new protocol. The total utilization is 
the same as for IIz. The delays for (7, IIs) are less than those for 
(T, IIz). Protocol II, guarantees that periodic sources do not collide 
with each other and that they successfully acquire the channel when 
they collide with aperiodic sources. Since there are fewer collisions, 
and fewer of the collisions that occur result in unsuccessful transmis- 
sions, less of the channel capacity is wasted. The decrease in delay is 
a result of the reduced-channel contention. 

Comparing II, and II; demonstrates the favorable characteristics of 
the new protocol but not the unfavorable characteristics. Protocol Il 
requires additional bits to be transmitted in the preempt and overflow 
areas of the message. To show the effect of these bits, II, is examined. 
This protocol is the same as II;, except the overhead in periodic 
packets is reduced. As the fraction of the load generated by periodic 
sources increases, the average channel utilization decreases. Curves 
(T, II,) are below (T, II,), but are still above (T, II,). The latter result 
is due to the ratio of data bits to packet length in the system simulated. 
The preempt header is dependent upon the propagation delay in the 
system. As the channel becomes longer, or the transmission rate 
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Fig. 3—Average delay versus the fraction of the load generated by periodic sources for several types of sources with protocols at the following 
utilizations: (a) p = 0.9; (b) p = 0.8; (c) p = 0.7. 
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Fig. 4—Standard deviation of delay versus the fraction of the load generated by periodic sources for several types of sources with protocols at the 
following utilizations: (a) p = 0.9; (b) p = 0.8; (c) p = 0.7. 


increases, this will comprise a larger fraction of the packet in II, and 
the operation of II, will improve with respect to II4. A similar improve- 
ment will occur if T, is decreased. 

Protocol II, establishes an upper bound on the delay experienced 
by periodic traffic by giving this traffic a higher priority than aperiodic 
traffic. The dashed curves (A, II4) and (P, IIs) represent the compo- 
nents of (7, II4) generated by aperiodic and periodic traffic, respec- 
tively. As expected, curves (A, II,4) are above (T, IIs), while (P, Ia) 
are below. However, there are some interesting effects noted on these 
curves. As the fraction of the load generated by periodic sources 
increases, the number of sources that have a higher priority than the 
aperiodic sources increases. The delay experienced by the aperiodic 
sources is expected to increase. However, at utilizations of 0.8 or 0.9, 
there is not a significant increase in the average delay, and the standard 
deviation of the delay decreases. 

From curves (7, IIz), it can be concluded that it is definitely 
beneficial for aperiodic sources to share a channel with periodic 
sources. From curves (A, II,), it may even be beneficial for aperiodic 
sources to share a channel with higher priority periodic sources. 

It has been stated that speech-activated systems should have a 
channel-acquisition delay less than 50 ms at least 98-percent of the 
time. From curves (A, II), it should be possible to meet this constraint 
up to utilizations of 0.9, even if the first packet of speech is given the 
same transmission priority as aperiodic traffic, assuming that the 98- 
percent point corresponds to two standard deviations beyond the 
mean. In II, the first packet from a periodic source is given a higher 
priority than packets from aperiodic sources. Using this strategy, it 
should be possible to lose no samples, not even those used to detect 
the presence of energy. For instance, if the transmitter stores the last 
30 ms of speech samples, and it takes 10 ms to detect the presence of 
energy, no active samples are lost when the channel is acquired within 
20 ms. At utilizations of 0.9, this appears to be well within the capability 
of the system. 


X. ALTERNATIVE PROTOCOLS 


Several other protocols for periodic traffic were considered. These 
protocols did not operate as well as the protocol described here and 
were not completely characterized. However, it is instructive to de- 
scribe two of these protocols and their perceived shortcoming. 

One possible approach to reduce the average and variance of the 
delay for periodic traffic is to give the periodic traffic a higher retrans- 
mission priority than aperiodic traffic. This approach is used in II, to 
reduce the delay for the first packet in a periodic sequence. Note in 
Figs. 3 and 4 that this strategy works best when the high-priority 
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traffic is a very small fraction of the total traffic. Priority systems do 
not reduce contention between high-priority users. In a local network 
designed to handle data and voice requirements, the voice transmission 
requirements are expected to far exceed the data requirements. A 
priority system will not provide significant improvements in this type 
of an environment. 

In addition, the priority system may reduce delay problems, but it 
does not eliminate them. The delay is still an unbounded statistical 
variable. The probability of exceeding a specified delay is nonzero, and 
packets will be lost. This probability may be acceptable for moderate 
utilizations, but as the utilization approaches one, it will increase and 
become significant. Because of the high transmission requirements of 
voice, it is expected that a voice and data system will have to operate 
at much higher utilizations than data only systems. 

Another protocol, which was tried, gives periodic traffic absolute 
preemptive right to the network. The periodic packets have a preempt 
header. They begin transmitting without listening to the network and 
interrupt any terminals that are transmitting. The preempt header is 
long enough for the nonpreemptive terminals to detect a collision and 
stop transmitting before useful data are transmitted by the preemptive 
terminals. This protocol results in zero transmission delay for periodic 
packets once they have acquired a slot. They acquire the slot for the 
first packet by behaving like an aperiodic terminal, as in the recom- 
mended protocol. 

This protocol requires exact timing synchronization between pe- 
riodic terminals and does not operate well at high-channel utilizations. 
Since the periodic terminals do not listen to the channel before 
transmitting and transmit periodically at what they perceive to be T, 
seconds, if there is a discrepancy between various terminals’ estimates 
of 7,, they will eventually interfere with one another. In addition, 
when the utilization for periodic sources is slightly greater than 50 
percent, it is possible for these sources to be spaced in such a way that 
no new sources can obtain a time slot. 

When this protocol was simulated, it was found that when the 
channel utilization from periodic sources exceeded 0.7, the channel 
efficiency decreased rapidly and the access time for the first packets 
from periodic sources and packets from aperiodic sources became 
intolerably long. The channel efficiency is the fraction of time the 
channel is being used to transmit data that are successfully received, 
over the total time that data are being transmitted. The unsuccessfully 
transmitted data occurs when packets are interrupted by collisions. 
The propagation delay in local networks is small compared with the 
packet transmission time, and terminals listen before transmitting. 
Therefore, for the previously defined protocols, collisions only occur 
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near the beginning of transmission. In this protocol, a packet can be 
interrupted at any time. As the fraction of time that preempt terminals 
use the system increases, the probability of interrupting a terminal 
that has completed a significant portion of its transmission increases 
and the channel efficiency decreases. As the channel efficiency de- 
creases, the fraction of the channel capacity remaining for successful 
transmissions decreases and the delays increase. When the channel 
utilizations generated by periodic sources approached 0.7, the delay 
experienced by first periodic transmissions and aperiodic transmissions 
ranged from several tenths of a second to a second. This is an 
intolerable delay. 


XI. CONCLUSIONS 


When periodic and aperiodic data are transmitted on the same 
network, using conventional CSMA/CcD protocols, the average and var- 
iance of the delay can be reduced by the periodicities in the data. By 
requiring periodic sources to begin generating periodic packets after 
they have successfully acquired the channel, the periodic requirements 
are separated in time, and collisions are reduced. This results in a 
decrease in the network delay. Channel contention is further reduced 
by requiring periodic sources to transmit all of the data they have 
acquired whenever they transmit and to schedule their next transmis- 
sion a fixed time after their last successful channel acquisition. 

If, in addition, periodic sources are only delayed by aperiodic sources 
with transmissions in progress, the size of packets from aperiodic 
sources is less than that from periodic sources, and the size of packets 
from periodic sources is not increased by the delay, then packets from 
periodic sources do not collide. The maximum channel acquisition 
delay experienced by periodic sources in this mode of operation is a 
packet transmission time. 

By delaying the use of the first packet from a periodic source by a 
packet transmission time, the next packet is always received before all 
of the samples in previous packets are used. There are no discontinu- 
ities caused by the late arrival of samples in the reconstructed wave- 
form, and packets of samples are never discarded because of excessive 
transmission delays. 

In this mode of operation, periodic sources appear to have a dedi- 
cated TpM channel. There is a difference between the bandwidth 
allocation in this type of system and that of a true TDM system. In a 
TDM system, the time slots for a particular channel always carry the 
same number of bits and are fixed with respect to the beginning of a 
periodically recurring frame. In this system, the time slot for a partic- 
ular channel may be moved back with respect to a hypothetical start 
of frame when the channel is busy. When this occurs, the number of 
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bits of data being carried by the slot increases. In the extreme, when 
the channel is overutilized, the slot is shifted back by the same amount 
every time it occurs. The slot period and the amount of data in the 
slot increases. 

The mobility of time slots, in conjunction with the access protocol, 
allows all of the excess capacity that is not being used to transmit 
periodic packets to be used by aperiodic sources, even if the excess 
capacity is dispersed in small intervals throughout the TpM frame. The 
system can shift between any mix of periodic and aperiodic traffic 
without a central controller reallocating the bandwidth. 

In conventional TDM systems, all of the transmitters must be able to 
accurately determine the location of their assigned slots to avoid 
transmitting simultaneously with other sources. In this system, a 
periodic source is willing to wait for a communication in progress to be 
completed. Therefore, timing inaccuracies between periodic sources 
can be tolerated, as long as the inaccuracy is not long enough to cause 
two periodic sources to simultaneously initiate communications. 

Finally, the simulation results indicate that periodic sources can 
acquire a channel quickly enough to operate in a time-assignment, 
speech-interpolation mode. Storage and delay in the system may be 
used to reduce, and possibly eliminate, clipping which normally occurs 
at the beginning of active intervals in this type of system. 
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Low-Elevation-Angle Propagation Effects on 
COMSTAR Satellite Signals 


By J. M. TITUS* and H. W. ARNOLD 
(Manuscript received April 30, 1982) 


1. INTRODUCTION 


Little information exists on earth-space propagation at lower eleva- 
tion angles. One earlier experiment’ has shown greater fluctuation in 
signal levels over low-elevation paths. This signal fluctuation likely 
results from focusing and defocusing by inhomogeneities in the atmo- 
sphere’s index of refraction caused by turbulence. The fluctuation 
increases at lower elevation angles, then, because the signal must pass 
through more atmosphere. 

The opportunity existed during 1976 to make more low-elevation 
observations at 19 GHz as the comstTaR satellites were placed into 
position. This paper describes the acquisition and analysis of the low- 
elevation propagation data. Results are presented showing the fluctua- 
tion intensity of the received signal at several elevation angles. 


il. EXPERIMENT 


Two satellites were launched carrying 19-GHz beacon transmitters, 
COMSTAR I in May 1976, and COMSTAR II in June 1976. As they were 
placed in their final positions, they drifted upward from the horizon at 
a rate of 1 to 2 degrees per day. Original data were taken at 19 GHz on 
Crawford Hill for elevation angles from 1 to 10 degrees above the 
horizon with a 3.7-meter diameter antenna and an interim receiver.” 
Signals received on vertical and horizontal polarizations (relative to 
the Crawford Hill horizon) were recorded on analog magnetic tape. 


* This work was performed while Ms. Titus was a summer student. 
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The satellites were rotating slowly; therefore, the directional antenna 
on the satellites caused the received signal level to vary. There was 
very little variation over a period of a few minutes, however. Weather 
throughout the initial experiment was generally cloudy and overcast, 
but there was no significant precipitation. The 0.3-degree beamwidth 
of the Crawford Hill antenna precluded ground-reflected signals. Since 
both transmitter and receiver had excellent amplitude stability, at- 
mospheric turbulence was the only reasonable source of the signal- 
amplitude fluctuations observed. These fluctuations occurred almost 
all the time, as contrasted with the more infrequent multipath fading 
observed on terrestrial paths. 

On the analog magnetic tape, data samples were taken once every 
hour for ten consecutive minutes. For each 10-minute interval (corre- 
sponding to a particular elevation angle), three sets of 256 points, 
sampled at a rate of 5 points per second, were digitized and recorded 
under control of an HP 9830 calculator. Signal amplitudes for the two 
polarizations were sampled simultaneously. The three 51-second sam- 
ples of digitized signals were taken from the beginning of the ten- 
minute run unless some readily apparent problem existed on the chart 
data. 


ltl. RESULTS 


After data samples were digitized from all magnetic tapes, an anal- 
ysis was made to determine how elevation angle affected signal fluc- 
tuation. For each sample set, the mean and standard deviation of the 
received signal amplitude for each polarization and the correlation 
coefficient between the vertically and horizontally polarized signals 
were calculated. 

Some sample sets were unusable because of low signal levels, so a 
method was devised to locate and delete them. It was assumed that 
the receiver noise level was the same as that existing later in the 
experiment, when the signal-to-noise ratio was approximately 46 dB in 
clear air. From this assumption a graph was constructed that gave, as 
a function of mean receiver output in volts, the standard deviation-to- 
mean ratio expected from receiver noise alone. Each sample set was 
checked and deleted if its fluctuation was not at least two standard 
deviations higher than that expected from receiver noise alone. 

For all remaining sample sets, scatter plots of fluctuation (defined 
as the ratio of the standard deviation to the mean) vs. elevation angle 
were printed. Figure 1 shows an example for COMSTAR I using horizon- 
tal polarization. Results for the horizontal polarization from COMSTAR 
m1 and for the vertical polarization from both satellites were also 
plotted, and were very similar to the results shown. Multiple data exist 
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Fig. 1—19-GHz signal amplitude fluctuation vs. path elevation angle for COMSTAR I 
using horizontal polarization. 


for certain elevation angles, because in those cases the satellites were 
at nearly the same elevation angles for more than one hour. 

Figure 1 indicates that signal fluctuation increases dramatically as 
the path elevation angle decreases. A simple theory to explain this 
trend involves the notion that the strength of the fluctuation is 
dependent on the number of air masses through which the propagation 
path passes. The number of air masses is proportional to the integral 
of the density (kg/m*) of the atmosphere along the path. One air mass 
is defined as the density integral along a vertical path. 

The number of air masses as a function of elevation angle was 
calculated in two ways: (i) by assuming that the atmosphere can be 
modeled as evenly distributed over a height up to nine km, and (iz) by 
modeling the atmosphere as six layers from altitudes of 0 to 30 km 
with different densities.* The latter was done for both evenly spaced 
layers (steps of 5 km/layer) and unevenly spaced layers (steps of 2, 3, 
5, 6, 6 and 8 km/layer) to observe the effect of the higher density of 
the lower atmosphere at the very low angles. The results for the two 
methods are compared in Fig. 2 and show no significant differences. 
Therefore, the simpler method was used for the following results. Both 
methods depart radically from the cosecant angular dependence seen 


PROPAGATION EFFECTS 1569 


—— STEPPED-DENSITY MODEL 


—— FIXED-DENSITY MODEL 


ELEVATION ANGLE IN DEGREES 
o 





0 4 8 12 16 20 24 28 32 36 40 
NUMBER OF AIR MASSES 


Fig. 2—Path elevation angle vs. number of air masses for stepped-density and fixed- 
density atmospheric models. 


at higher elevation angles,* owing to the earth’s curvature. 

Figure 3 shows a scatter plot of signal fluctuation vs. number of air 
masses for the combined data from COMSTAR I and COMSTAR I. This 
figure shows that the fluctuation of the signal increases rapidly as the 
number of air masses increases. The functional form of this dependence 
cannot be determined from the available data, but appears to be at 
least linear. For very low-elevation-angle paths (1 to 1.25 degrees) 
traversing a large number of air masses (24), the signal fluctuation 
varies about 0.25. For a significantly smaller number of air masses (i.e., 
below 8), and relatively higher elevation angles (above 6.50 degrees), 
the fluctuation decreases to around 0.05. 

Results for both received polarizations show that fluctuations with 
number of air masses for the two polarizations are nearly identical. 
Instantaneous correlation coefficients between the polarizations were 
typically above 0.95 when the fluctuations were also large compared 
with the receiver noise level (i.e., at low-elevation angles). It thus 
appears that the cause of the fluctuation is not dependent on the state 
of the polarization—vertical or horizontal. This is the first known 
observation of this polarization independence for elevation angles 
below 5 degrees. 
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Fig. 3—19-GHz signal amplitude fluctuation vs. number of air masses along path for 
combined COMSTAR I and COMSTAR II data using horizontal polarization. 


IV. CONCLUSIONS 


Two conclusions can be drawn from this experiment. One is that the 
signal fluctuation is independent of polarization. Two facts lead to this 
conclusion: (z) there is a very high correlation between signal strengths 
at the two polarizations; and (iz) the results for fluctuation vs. number 
of air masses were nearly identical for the two polarizations. The 
second conclusion is that fluctuation intensity increases at lower 
elevation angles. This increase is consistent with the increase in the 
number of air masses along the path. 
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